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The press under pressure 


With the Leveson inquiry scrutinizing journalistic practice in the United Kingdom, scientists 
should take the opportunity to fight back against agenda-driven reporting. 


researchers in Britain probably have more than most. With sto- 

ries ranging from ludicrous (wind turbine attacked by aliens) to 
downright irresponsible (promoting the link between childhood vacci- 
nations and autism), the fourth estate in the United Kingdom has hardly 
covered itself in glory when it comes to science and scientific issues. 

Other countries have similar grievances, of course — particularly 
the United States, where right-wing talk radio and cable television 
regularly air anti-science views on everything from global warming 
to creationism. Stem-cell scientists in Germany and transgenic-crop 
researchers in France have also been assailed by journalism out of 
step with the scientific evidence that it claims to examine. But there 
is a sense that the situation is more acute in tabloid-driven Britain, 
particularly given the distasteful news-gathering techniques that are 
now under the microscope like never before. 

In Britain, eyes are on an inquiry into the standards and ethics of 
the press, headed by Lord Justice Brian Leveson. Widely known as the 
‘phone-hacking inquiry, triggered as it was by revelations about the 
extent of illegal eavesdropping at the now-defunct News of the World 
newspaper, the judicial investigation in fact has a much wider scope. In 
his opening remarks, Robert Jay QC, counsel to the inquiry, said that 
he expected members of the scientific community to submit evidence 
that sections of the press were causing real harm by not basing their 
commentaries on evidence and not applying the scientific method to 
their reports. But his remarks seem to have surprised many within 
the community he was referring to — a subsequent search by the Sci- 
ence Media Centre, an advocacy group in London, found just a single 
planned submission, and it now intends to send its own. 

This is a wasted opportunity. Too often, talk about the difficult rela- 
tionship between the media and science gets bogged down in well- 
meaning but ultimately naive discussion of how to ‘help’ reporters 
to get their facts straight. Should journalists send stories to scientists 
to be vetted before publication? Should they have scientific training? 
Should scientists be trained to offer sound bites? 

Alastair Campbell, the former communications chief for the UK gov- 
ernment and a man who knows a thing or two about sound bites, better 
characterized the problem when he told the Leveson inquiry about 
“agenda-driven journalism regardless of facts”. Campbell was referring 
to the media fixation with autism and childhood vaccinations, but it 
could have been anything from climate change (on which positions 
range from ‘it’s not dangerous to ‘it’s going to kill us all’) or volcanic 
ash and aviation (it’s not dangerous) to bird flu (it’s going to kill us all). 

Sometimes the agenda is obvious and explicitly political. More 
often, it is the instinctive overreach of a story-teller who chooses what 
to include to make their tale as interesting as possible. Either way, the 
problem runs deeper than reporters regularly confusing bacteria and 
viruses, however irritating that may be to some. Witness the list of the 
top 100 UK political journalists of 2011, as decided by Total Politics 


veryone has an example of the scientific ignorance of the press, but 


magazine last week. In at number 14 is Christopher Booker, a column- 
ist for The Sunday Telegraph. Not satisfied with Booker being “skeptical 
about global warming’, the magazine partly credits his high rank to 
his claims that evidence to prove that passive smoking and exposure 
to asbestos cause cancer “does not exist”. 

Journalism that favours attitude over accuracy is more common 
than scientists suspect, and not just on the comment pages or in 
the tabloids. And it is also more damaging — with news editors 
behind the scenes ordering certain lines 


‘J ournalism that oy high-profile stories, no matter what the 
favours attitude science says. 

over accuracy The evidence offered to Lord Leveson 
ismorecommon shows that science is far from alone in this 
than scientists treatment: parents of missing and murdered 
suspect.” children have queued up to tell harrowing 


stories of blatant misrepresentation, on top 
of the indignity of having voicemails hacked. 

But science has a way to respond that others do not. Through online 
forums, blogs and Twitter, a cottage industry has grown up around 
instant criticism of dodgy scientific claims and dubious findings. This 
parallel journalism is increasingly coming to the attention of the main- 
stream press — as demonstrated by the rising number of stories in the 
press that were first broken by blogs. 

It may seem thankless at times, but the army of online commenta- 
tors who point out the errors, the inconsistencies and the confounding 
factors, and from time to time just scream ‘bullshit, have the power to 
hold the press to account. This ongoing war of attrition against those 
who would put their own agendas above the facts cannot take away 
their platform, but it can chip away at something they prize even more: 
their relevance, and with it their pernicious influence. = 


Troubled waters 


Protected areas are only the start of the road to 
reforming our relationship with the seas. 


simply to declare it as such. The land is then designated as 

protected, and counts towards the relevant government's tar- 

gets to set aside a certain amount of its territory from development. 

That is a ridiculous example, of course, and would never happen on 
land — so why do we allow a similar exercise to happen in the sea? 

No one should doubt that our seas need protection. Overfishing, 

pollution and climate change are fundamentally changing some of the 
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most important regions on the planet. And the conservation response 
— marine protected areas (MPAs) — should bea key tool to safeguard 
the world’s maritime environment. By setting aside areas in which 
human activity is tightly regulated, the thinking goes, governments 
can ensure that key habitats and species are preserved. 

Certainly, good MPAs such as those in Hawaii and around the 
Great Barrier Reef can bring great benefits, from saving individual 
species from extinction to preserving entire coral-reef or deep-sea 
ecosystems. And the number of these areas is on the rise, as the world’s 
governments crawl towards a target to protect 10% of the oceans. 

But to get the most from these areas, much greater efforts are 
needed. In the sea, as on land, balancing the recommendations of 
scientists with commercial needs and other interests is always tricky 
for politicians. Still, as highlighted on page 166, and as shown by recent 
developments in Australia (see Nature 480, 14-15; 2011), there are 
growing concerns about the poor science base of many of the existing 
and proposed MPAs. 

As Australia divides up its waters into a patchwork of protected, 
recreational, commercial and multipurpose areas, researchers say 
that the country is ignoring evidence that would make this zon- 
ing stronger. These concerns, which surfaced over proposals for 
the southwest region, again in the northwest region and then most 
recently in the plan for a huge reserve in the Coral Sea, centre on 
whether enough sea has been given the highest level of protection 
and whether the areas that are highly protected include a repre- 
sentative selection of all necessary habitats. The second question in 


particular is one on which the voices of researchers must be heard 
by government, and acted on. 

And even when reserves are established on the basis of sound 
science, governments must ensure that they can protect the areas 
properly. It is telling that even the prosperous United States may be 

failing in this. Jane Lubchenco, the head of 


“Declarations the US National Oceanic and Atmospheric 
of protection Administration, says: “We don't have the 
are worthless resources that we need to actually monitor, 
without actions enforce and understand these areas.” If the 


and, crucially, 
without money.” 


United States is having trouble, what hope is 
there for the less-developed nations? 

In the sea, as on land, declarations of pro- 
tection are worthless without actions and, crucially, without money. 

Equally, governments must not fall into the trap of thinking that 
designating and protecting a certain percentage of our seas will solve 
all maritime conservation problems. Larger efforts to reform our 
approach to the oceans, from fisheries to extraction, are long over- 
due. Efforts in New England waters have demonstrated that it might 
be possible to manage local fisheries sustainably (see Nature 465, 
540-542; 2010), and international agreements — such as those that 
have safeguarded Antarctica — show that countries can pull together 
under the right circumstances. For too long what has been out of sight 
has also been out of mind. The rise of MPAs should be a starting point 
for a reformation, not a tool to hit a target so that we say ‘job done’ and 
move on to other issues. m 


A massive comfort 


There are bigger things in the Universe than 
our earthly worries. Much bigger. 


distraction from everyday life. Some see their futures in the 

stars and others talk to the man in the Moon. But for the 
really big problems, you need a really big distraction. So for readers 
who feel especially gloomy, perhaps about the latest bout of politi- 
cal shadowboxing on global warming in Durban, South Africa, or 
the perilous state of European finances, let Nature introduce you to 
two newly identified black holes. These are not the ordinary, two- 
a-penny stellar black holes, mind, but their supermassive relatives 
— the big beasts of the cosmos. 

On page 215 of this issue, astronomers from across the United States 
present their discovery of the two most massive black holes ever found. 
The larger of the pair would tip the scales at some ten billion times the 
mass of our own Sun. Heavy stuff indeed. 

In some respects, it is surprising that these black holes have 
managed to hide for so long. The curious twinkling of some qua- 
sars dated to the earliest days of the Universe had made astronomers 
pretty sure that the giants were out there somewhere, yet the previous 
heavyweight champion, in the giant elliptical galaxy Messier 87, isa 
relatively puny 6.3 billion solar masses. 

Measuring the motion of stars from two different galaxies that 
are the biggest in their clusters, NGC 3842 and NGC 4889, Nicholas 
McConnell of the University of California, Berkeley, and his colleagues 
found that NGC 3842 has a central black hole with a mass of 9.7 billion 
solar masses, and NGC 4889 has a black hole about the same size or 
even larger. (All massive galaxies with a spheroidal component, such 
as the bulge of the Milky Way, are thought to harbour a supermassive 
black hole at their centre.) 

As celestial inspirations go, it is true that the giant twins of dark- 
ness at the heart of NGC 3842 and NGC 4889 are somewhat distant 


Pp eople have always looked to the heavens for inspiration and 
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companions. The nearest is about 98 megaparsecs from Earth — about 
320 million light years. And it is true that McConnell’s team was able to 
look to the sky with more than a shrug of resignation — the research- 
ers could use integral-field spectrographs at the giant Gemini North 
and Keck telescopes perched on Hawaiian mountains. But what these 
supermassive black holes lack in touchy-feely access, they surely gain 
in being really, really big. And if it helps those of us weighed down by 
terrestrial matters, they are out there, somewhere. 

As supermassive black holes go, they are perhaps just a little too 
supermassive, certainly a bit larger than astronomers would have pre- 
dicted given what they know about their host galaxies. Perhaps the 
processes that influence the growth of the largest galaxies and their 
black holes are somehow different from those in smaller galaxies? As 
the authors note, better adaptive optics instruments on telescopes and 
very-long-baseline radio interferometry are finding black holes in ever 
more exotic galaxies, and will help to answer that question. 

It now seems plausible that, within a decade or so, astronomers 
will be able to capture the first direct image of a black hole. Plans 
are afoot to link a network of instruments, stretching from the high 
desert plains of Chile and the South Pole to potential sites in Africa 
and New Zealand. Pointed at the centre of our own galaxy, sucha 
telescope should be able to see SgrA*, the supermassive black hole 
believed to sit there. Astronomers already have an image in mind: 
a bright ring spun around a dark shadow cast by the black hole, 
because no light escapes its event horizon. 

Widely described as the point of no return, the event horizon of 
SgrA* would be the largest in our skies, but still just 30 microarcsec- 
onds across — the apparent size of a tennis ball on the Moon when 
viewed from Earth. To capture its image would be a stunning technical 
achievement in itself, but it would also open the door to further studies 
of how black holes spin and gather material, as well as probing some 
fundamental aspects of space-time and general relativity. And the first 
picture taken of our local supermassive black hole — the most enig- 
matic and charismatic of all the wonders of the 
Universe — would surely be one of the defining 
images of the time. It might even knock every- 
day trouble and strife from the front pages, and 
perhaps even, for a while, from people’s minds. m 
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Europe budget rise 


The European Commission 
has proposed an €80-billion 
(US$107.5-billion) budget for 
Europe’ next research funding 
programme, which runs from 
2014 to 2020. That is a 60% 
rise on the current programme 
(running 2007-13), but 
individual sectors get larger 
funding boosts: the European 
Research Council, a grants 
agency for basic research, 

is in line for a 77% rise 

to €13.2 billion. Europe’s 
research commissioner, 

Maire Geoghegan-Quinn, 
emphasized that stem-cell 
research would be supported, 
assuaging fears that it might 
not after Europe’s Court of 
Justice banned patents on 
human embryonic stem cells 
in October. The proposals, 
released on 30 November, 
will now be discussed by the 
European parliament and by 
member states, with the aim 
of reaching an agreement by 
the end of 2013. See go.nature. 
com/hxmqua for more. 


UK medical boost 


British scientists may gain 
wider access to data from 
patients in the country’s 
National Health Service, the 
government announced on 

5 December. It is consulting 

on plans for patient data to 

be automatically ‘opted in’ to 
clinical research, which could 
include research by private 
companies. The consultation is 
part of a package of measures 
to boost Britain's life-sciences 
industry, including a fund 
(half of it new spending) worth 
£180 million (US$282 million) 
to speed drugs from lab to 
clinic. See go.nature.com/ 
qe6wmp for more. 


ITER budget saved 


After more than a year of 
deadlock, European policy- 
makers have agreed a deal 


Durban debates climate treaty 


Delegates at the United Nations climate talks 
in Durban, South Africa, are asking afresh 
whether the world needs a new legally binding 
climate treaty when the Kyoto Protocol expires 
next year. Campaigners rallying in the city 
(pictured) urged a breakthrough, but none 
had arrived during the first week of talks. 


to plug a €1.2-billion 
(US$1.6-billion) funding gap 
in the 2012-13 budget of ITER, 
the €15-billion experimental 
fusion reactor being built 

near Cadarache, France. 

The agreement redirects 

€840 million from other areas 
of European spending, and 
importantly avoids raiding 
Europe’ research budget, 
which ministers had threatened. 
to do. The remaining 

€360 million will come from 
as-yet-unspecified areas of 
Europe's 2013 budget, policy- 
makers agreed. See go.nature. 
com/qjnxy3 for more. 


Plutonium to burn 
Hoping to get rid of its 
112-tonne stockpile 

of plutonium, the UK 
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government said last week 

that it was considering 
building a plant to recycle the 
plutonium into mixed oxide 
fuel (MOX). The move comes 
just four months after the 
Nuclear Decommissioning 
Authority announced the 
closure of the country’s only 
nuclear reprocessing facility, 

at Sellafield (see Nature 476, 
140; 2011). That plant (one of 
only two commercial MOX 
facilities in the world) has been 
beset with technical problems. 
But anew MOX reprocessing 
plant is not the only option: 

on 30 November, GE Hitachi 
Nuclear Energy, based in 
Wilmington, North Carolina, 
proposed building a fast reactor 
fuelled by plutonium. See 
go.nature.com/Irsqza for more. 


Negotiators did at least make some progress 

on initiatives to curb deforestation, striking 

a technical deal that lays out ground rules by 
which countries would report baselines for rates 
of deforestation, and would submit these to 
international peer review. See go.nature.com/ 
gmzoab for the latest blogs from Durban. 


| RESEARCH 
Elements named 


Pencil two new names into 
the periodic table: elements 
114 and 116 should be 

called flerovium (Fl) and 
livermorium (Lv), the 
International Union of Pure 
and Applied Chemistry 
(IUPAC) proposed on 

1 December. The names 
honour the institutes that 
collaborated to create 

the artificial elements: 
livermorium for the 
Lawrence Livermore National 
Laboratory in California, 

and flerovium for Georgy 
Flerov, founder of the Flerov 
Laboratory of Nuclear 
Reactions at the Joint Institute 
for Nuclear Research in 


S. VAN ZUYDAM/AP 


#* Dubna, Russia. (Dubnium is 


I/CORB 


S 


already an element.) IUPAC 
will allow five months 


& for public comment, but 
2 the names are likely to be 
& confirmed next May. See 
~ go.nature.com/2wvwkc 


SOURCE: NATURE CLIMATE CHANGE 


for more. 


Earth-like planet 
Kepler, NASAs planet- 
hunting space telescope, has 
announced its first potentially 
habitable planet around a star 
like our Sun. Kepler-22b is 

not an Earth twin: at 2.4 times 
Earth’s radius, it could be more 
like a gassy Neptune than a 
rocky planet. But it sits in a 
comfortably warm orbit from 
its host star, at temperatures 
similar to those experienced 
by Earth. The planet — 
announced on 5 December — 
is 184 parsecs (600 light years) 
from us. Kepler’s full catalogue 
now includes 2,326 planet 
candidates. See go.nature.com/ 
xc6we6 for more. 


Stapel retraction 
Dutch social psychologist 
Diederik Stapel, who admitted 
faking data in numerous 
published papers, last week 
retracted a 2011 study 

that suggested disorderly 
environments encourage 
discrimination (D. A. Stapel 
and S. Lindberg Science 332, 
251-253; 2011). It is the first 
in what is likely to be dozens 
of retractions. Committees at 
the universities of Amsterdam, 


TREND WATCH 


The sharp rise of carbon 
emissions from 2009 to 2010 — 
by nearly 6% — suggests that the 
global financial crisis has had 
little impact in slowing down 

the world’s output of greenhouse 
gases, according to an analysis 
published on 4 December 

(G. P. Peters et al. Nature Climate 
Change http://doi.org/g2m; 
2011). The rise was mainly due 

to high emissions growth in a few 
key economies. China’s emissions 
were up by 10.4%, India’s by 

9.4% and Brazil's by 11.6%. See 
go.nature.com/bvlgbj for more. 


Groningen and Tilburg, where 
Stapel studied and worked 
between 1994 and 2011, aim 
to identify tainted papers for a 
report that will not be finished 
until mid-2012. See go.nature. 
com/8lv4n2 for more. 


Warmest La Niiia 


This year is set to be the 

tenth warmest since records 
began in 1850, the World 
Meteorological Organization 
(WMO) said on 29 November. 
But it will also be the hottest 
year to have experienced 

a strong La Nifa event — 

the natural cycle in which 
temperature falls over vast 
areas of the sea surface in the 
equatorial east-central Pacific 
Ocean. The WMO used figures 
up to October from three data 
sources to make its projection. 


Open science apps 


A website where people can 
upload their genetic and 
phenotypic information has 
come first in a contest to 

build applications to make 
science more open. OpenSNP 
(opensnp.org) won the 
inaugural API Binary Battle, 
sponsored by open-access 
publisher the Public Library of 
Science and the paper-sharing 
site Mendeley. See go.nature. 
com/wdw9e¢w for more. 


Deforestation down 
Deforestation in the Brazilian 
Amazon rainforest has fallen to 
a record low for the third year 


CARBON SHRUGS OFF CRISIS 


running, according to satellite 
data released on 6 December 
by Brazil's National Institute 
for Space Research (INPE) 

in Sao José dos Campos. It 
seems that strong government 
enforcement countered forest 
clearing after sudden spikes 

in deforestation (pictured) 
were reported from March 

to May. In August, the INPE 
had reported a 15% rise in 
deforestation for the year, 
which runs from August 2010 
to July 2011. But that report was 
based on coarser-resolution 
satellite imagery. See go.nature. 
com/pxpkyg for more. 


Arctic oil setback 
Cairn Energy, a company that 
is leading efforts to find oil and 
gas off the coast of Greenland, 
announced on 30 November 
that it is plugging and 
abandoning the final wells 

it drilled this year. The firm, 
based in Edinburgh, UK, has 
failed to find any commercially 
viable reserves after two 

years drilling eight offshore 


Global carbon emissions seem to have recovered faster from the 
global financial crisis than from previous economic crises. 
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SEVEN DAYS | THIS WEEK | 


9 DECEMBER 
International climate 
negotiations in Durban, 
South Africa, conclude. 
See go.nature.com/ 
dw9pu2 for the latest 
from the meeting. 


13 DECEMBER 
Latest results from the 
Large Hadron Collider 
will be revealed at a 
CERN public seminar 
in Geneva, Switzerland. 
See go.nature.com/ 
lyy96r for rumours 
before the event. 
go.nature.com/rqgjuc 


exploration wells at a reported 
cost of about US$1 billion. 
Market analysts speculated that 
the company might give up 
drilling in 2012, although other 
companies, including Shell, 
Exxon, Chevron and Statoil, 
hold licences to explore off 
Greenland. 


Meltdown admitted 


The operators of Japan's 
stricken Fukushima Daiichi 
nuclear plant have made their 
direst assessment yet of the 
meltdown that took place in 
March. In the most damaged 
reactor, Unit 1, all the fuel 
melted through the steel 
pressure vessel and seeped into 
(but not through) the concrete 
base of the containment vessel 
below, the Tokyo Electric 
Power Company (TEPCO) 
admitted on 30 November. 
TEPCO had previously said 
that only some of the fuel 

had melted, but its latest 
simulations fall into line with 
models run by observers such 
as the US Nuclear Regulatory 
Commission. Leaks of 
radioactive water continue 

at the site: on 4 December, 
TEPCO said that it was using 
sandbags to contain a new leak 
from a building in which water 
was being decontaminated. 


> NATURE.COM 
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Cooking up a 
model of Earth’s core in the 
laboratory p.162 


excellence at the University 


Will cuts sacrifice 
of California’? p.164 


new ship p.166 


China’s 
ambitions ride a spanking 
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US President Barack Obama last week pledged to fund HIV treatment for millions more people worldwide over the next two years. 


Cutbacks threaten HIV gains 


Wealthy nations have reduced contributions to global research and treatment programmes. 


BY MEREDITH WADMAN 


r | Mhirty years after AIDS was first recog- 
nized as a human scourge, major recent 
gains in treatment and prevention risk 

being derailed by the global financial crisis. 

On 23 November, the Global Fund to Fight 
AIDS, Tuberculosis and Malaria announced 
that it will not fund new grants for prevention 
and treatment until 2014, owing to “substantial 
budget challenges in some donor countries”. 
The fund's HIV activities run the gamut from 
counselling and testing pregnant women in 
India to providing medications to infected 
children in Kenya. 

And on 7 December, a report from Policy 
Cures in Sydney, Australia, a group that mon- 
itors global research and development for 
neglected diseases, showed that public and pri- 
vate funders last year cut their commitments 
to HIV/AIDS research targeted at the devel- 
oping world by US$67.5 million, or 5.9%. The 
decline was due entirely to cutbacks by wealthy 


nations, which slashed spending by $72.6 mil- 
lion; poorer countries actually increased 
funding by $5.1 million. Where research is 
concerned, “AIDS had a bigger drop in dol- 
lar terms than any other disease”, says Mary 
Moran, the executive director of Policy Cures 
and the lead author on the report. 

The threat comes even as US President 
Barack Obama last week promised that the 
United States will step up its worldwide attack 
on AIDS. Breaking the White House’ past reti- 
cence on the issue, he urged nations to honour 
their unmet pledges to the Global Fund. 

“Countries that have committed to the 
Global Fund need to give the money that they 
promised,” he said on 1 December, World 
AIDS Day. “Countries that haven't made a 
pledge, they need to do 
so,’ he added, singling 
out emerging nations 
such as China that are 
recipients of funds “but 
now are in a position to 


For more, see 
Nature's Outlook on 
HIV/AIDS: 


step up as major donors”. 

The United States has given $6.1 billion 
to the Global Fund since 2004, and last year 
pledged to contribute $4 billion between 2011 
and 2013. Congress approved $1.05 billion in 
2011, meaning that allotments must increase 
substantially in 2012 and 2013 to honour the 
pledge. Obama has asked Congress to provide 
$1.3 billion in 2012, but Senate lawmakers have 
so far resisted the increase. 

Obama pledged to increase by 50% — to 
6 million — the number of people receiving 
antiretroviral therapy (ART) worldwide by 
2013 through the President’s Emergency Plan 
for AIDS Relief (PEPFAR), the nation’s major 
global treatment and prevention programme. 
Healso said PEPFAR would aim to provide ART 
to an additional 1.5 million pregnant women 
with HIV in the next two years. The interna- 
tional promises came with no new money 
immediately attached. But Obama has asked 
Congress to provide nearly $7.2 billion — a 6% 
increase — for PEPFAR in 2012, part of the 
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> unfinished budget bills still being debated. 

Advocates said that his speech marked 
a significant turning point. “This is the first 
time he’s signalled that he’s going to champion 
global HIV in this way,” says Jennifer Cohn, 
an assistant professor of infectious diseases at 
the University of Pennsylvania in Philadelphia, 
and a policy adviser in Nairobi for Médecins 
Sans Frontiéres (MSF; also known as Doctors 
Without Borders). “Whether or not this gets 
translated into the president’s 2013 budget 
request, or what Congress chooses to protect 
during the 2012 budget negotiations — that’s 
what I’m waiting to hear.’ 

Even if the United States manages to expand 
treatment for HIV/AIDS, research and devel- 
opment aimed at poor countries is suffering. 
Moran points out that of the $67.5 million cut 
from HIV/AIDS research spending in 2010, 
$32 million was due to cuts by the US National 
Institutes of Health, and a further $17 million 
was cut by the UK Department for International 
Development (see ‘Fighting a killer’). Other 
neglected diseases are seeing similar reductions 
in research funding (see go.nature.com/sgv70)). 

“A lot of this reflects the financial crisis, that’s 
undoubted, Moran adds. “But with HIV, some 
of it also reflects a shift from the more costly 
late-stage vaccine and microbicide trials that 
haven't worked. We are moving back to basic 
research, which is cheaper.’ 

The cutbacks coincide with encouraging 
results on HIV/AIDS treatment. One study 
published in August found that early treat- 
ment of those infected greatly decreases 
sexual transmission to their uninfected part- 
ners, (M. S. Cohen et al. N. Engl. J. Med. 365, 
493-505; 2011). Other recent work suggests 
that ART, although costly, will pay for itself and 
more by increasing labour productivity, reduc- 
ing the need for orphan care and deferring 


FIGHTING A KILLER 


Funding for research and development (R&D) of HIV/AIDS products targeted at developing countries fell last 
year, whereas treatment with antiretroviral therapies has grown rapidly over the past decade. 
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medical treatment costs (S. Resch et al. PLoS 
ONE 6, €25310; 2011). “Not only is [treatment] 
the right thing to do, it’s also the smart thing 
to do,’ says Sharonann Lynch, an HIV policy 
adviser with MSF in New York. 

But the ‘treatment equals prevention’ equa- 
tion invoked by Obama is not selling in all 
quarters. Roger Bate, a health economist and 
HIV expert at the American Enterprise Insti- 
tute, a think tank in Washington DC, says that 


he is not yet persuaded by the studies sug- 
gesting that treatment prevents transmission. 
“There is a lot of effort out there to ignore noise 
in those studies,” he says. 

He adds that, even if treatment does prevent 
transmission, the commitment to treat infected 
people for their lifetimes quickly becomes an 
enormous financial outlay in future years. “Is 
it sustainable to treat even more people under 
current budgets?” he asks. “I don't think it is.” m 
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NANOTECHNOLOGY 


Nano rules fall foul of data gap 


Incomplete nanotoxicology research is hampering efforts to introduce regulation. 


BY EUGENIE SAMUEL REICH 


r | “the science of the very small will lurk in 
many a Christmas gift this year. Nano- 
materials add strength to golf clubs, 

odour resistance to socks and ultraviolet 

protection to cosmetics. But some of their 
properties could also pose health risks, and 
regulations covering their manufacture and use 
have failed to keep pace with the rush to market. 

With sizes measured in mere billionths of 

a metre, nanomaterials have very high sur- 

face area relative to their mass, and this could 

alter their toxicity compared with the same 


material in bulk. Yet many nanomaterials are 
not regulated separately — in part because the 
high-quality nanotoxicity research needed for 
regulation is lacking, as an assessment pre- 
sented at last week’s Materials Research Society 
meeting in Boston, Massachusetts, made clear. 
Groups representing food manufacturers had 
asked Cantox Health Sciences International, a 
consultancy in Mississauga, Canada, to review 
the literature for studies that could shed light 
on the effects of consuming nanoparticles from 
food packaging, in which they may be used to 
improve strength or even detect spoilage. 
Cantox sampled 30 papers describing studies 
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in which nanoparticles were fed to animals or 
applied in vitro to cells of the gastrointestinal 
tract. Of 21 animal studies, the Cantox 
researchers found that 15 failed to attain the 
highest grade on the Klimisch test, a quality- 
control criterion that assesses whether basic 
data, such as the number or gender of animals 
used, have been reported. “Some didn't give the 
dosage. This is a fairly important aspect,’ says 
Jeffrey Card, a toxicologist at Cantox who led 
the study. 

The underlying materials science was 
equally spotty. Of 10 parameters that Cantox 
concluded should be reported in a nanotoxicity 
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> unfinished budget bills still being debated. 
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global HIV in this way,” says Jennifer Cohn, 
an assistant professor of infectious diseases at 
the University of Pennsylvania in Philadelphia, 
and a policy adviser in Nairobi for Médecins 
Sans Frontiéres (MSF; also known as Doctors 
Without Borders). “Whether or not this gets 
translated into the president’s 2013 budget 
request, or what Congress chooses to protect 
during the 2012 budget negotiations — that’s 
what I’m waiting to hear.’ 

Even if the United States manages to expand 
treatment for HIV/AIDS, research and devel- 
opment aimed at poor countries is suffering. 
Moran points out that of the $67.5 million cut 
from HIV/AIDS research spending in 2010, 
$32 million was due to cuts by the US National 
Institutes of Health, and a further $17 million 
was cut by the UK Department for International 
Development (see ‘Fighting a killer’). Other 
neglected diseases are seeing similar reductions 
in research funding (see go.nature.com/sgv70)). 
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that ART, although costly, will pay for itself and 
more by increasing labour productivity, reduc- 
ing the need for orphan care and deferring 
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in which nanoparticles were fed to animals or 
applied in vitro to cells of the gastrointestinal 
tract. Of 21 animal studies, the Cantox 
researchers found that 15 failed to attain the 
highest grade on the Klimisch test, a quality- 
control criterion that assesses whether basic 
data, such as the number or gender of animals 
used, have been reported. “Some didn't give the 
dosage. This is a fairly important aspect,’ says 
Jeffrey Card, a toxicologist at Cantox who led 
the study. 

The underlying materials science was 
equally spotty. Of 10 parameters that Cantox 
concluded should be reported in a nanotoxicity 


study, such as particle size, shape and sur- 
face chemistry, more than half were missing 
from 14 of the animal studies’. 

Yongping Bao of the University of East 
Anglia in Norwich, UK, an author on one 
of the studies” given low marks by Cantox, 
points out that some of the missing data 
appear in other papers by his group. But 
he believes that the review highlights the 
need for government regulatory agencies 
to establish agreed standards for this sort 
of research. “Anyone can set his or her own 
criteria to review nanotoxicity literature, so 
a gold standard is urgently needed,’ he says. 
Nanotoxicity studies arguably require addi- 
tional standards, because nanoparticles can 
vary within a batch much more widely than 
do bulk chemicals, for example. 

Altaf Carim, assistant director of nano- 
technology at the White House Office of 
Science and Technology Policy, says that 
the US National Nanotechnology Initiative’s 
Environmental, Health, and Safety Research 
Strategy’, unveiled in October, will develop 
a set of standards that should help research- 
ers and agencies to plug that gap. “We are 
certainly hoping the broader community is 
going to make use of it,” Carim says. 

Nanotoxicologists have also lacked 
standard reference materials for defining 
nanoparticles. The US National Institute 
of Standards and Technology has pro- 
duced references for gold and polystyrene 
nanoparticles, but has yet to issue similar 
samples of other nanomaterials. 

Later this month, however, the institute 
will release its first batch of standard refer- 
ence carbon nanotubes — the result of three 
years work, according to Kalman Migler, 
who is overseeing the project. Researchers 
will be able to buy vials of the nanotubes 
to calibrate their measurement equipment 
and assess levels of common contaminants 
in their samples. The reference nanotubes, 
the first ever provided by a national stand- 
ards agency, should help nanotoxicologists 
to pinpoint whether any observed adverse 
reactions are the result of nanoparticles 
rather than impurities, for example. 

Yet the latest nanotechnology decree 
from Europe perhaps shows how far regu- 
lation efforts still have to go. On 18 October, 
the European Commission finally adopted 
a definition of a nanomaterial: one in which 
half or more of the particles in a sample 
have one dimension that measures between 
1 and 100 nanometres’. The definition had 
been requested by the European Parliament 
in April 2009. = 
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Novartis to shut brain 
research facility 


Drug giant redirects psychiatric efforts to genetics. 


BY ALISON ABBOTT 


ovartis has joined other pharmaceu- 
| \ | tical companies in abandoning tradi- 
tional drug-discovery programmes 

that pursue treatments for brain disorders. 

Nature has learned that the company is 
closing its neuroscience facility in Basel, 
Switzerland, where Novartis is headquartered. 

It follows similar moves by GlaxoSmith- 
Kline and AstraZeneca, both based in the 
United Kingdom, which last year announced 
the closure of all their neuroscience research 
divisions globally. US-based companies Pfizer 
and Merck, as well as the French company 
Sanofi, have also pulled back on research into 
brain disorders. Rather than abandon neuro- 
science, however, Novartis says it will launch 
new programmes to study the genetics of psy- 
chiatric and cognitive disorders, in the hope of 
identifying new treatment strategies. 

Developing drugs for brain disorders has 
become a high-risk activity, with most can- 
didates failing after years of expensive clinical 
trials. The market is already flooded with 
cheap, generic antidepressants, antipsychot- 
ics and other drugs that act on known targets 
in the brain, mostly neurotransmitter recep- 
tors (see ‘Mental-health drug sales’). This has 
forced companies to look for radically new 
drug targets, but the search is proving difficult 
because little is known about the biology of the 
brain and its disorders. 

“Standard approaches to developing drugs 
for mental health have not reaped significant 
benefit in the past two decades,” says Ken 
Kaitin, director of the Tufts Center for the 
Study of Drug Development in Boston, Massa- 
chusetts. “But it is a dilemma for the companies 
because there is a large and growing market for 
these products.” Mental disorders impose the 
largest disease burden worldwide (see Nature 


MENTAL-HEALTH DRUG SALES 


Treatments for brain disorders represented a huge 
market for the pharmaceutical industry in 2010. 
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478, 15; 2011), and current treatments do not 
work particularly well for most patients. 

The Basel neuroscience department has 80 
staff, and five candidate drugs in the pipeline 
that act on the central nervous system (CNS). 
These drugs will continue to be developed, 
says geneticist Mark Fishman, president of 
research for the company’s main research arm, 
the Novartis Institutes of BioMedical Research 
(NIBR). Novartis is negotiating with unions 
over the terms of the closure, which is likely to 
happen during 2012. 

Fishman says that the NIBR will open a 
new research division to study the genetics of 
psychiatry and cognitive disorders at its site in 
Cambridge, Massachusetts, taking advantage 

of academic strength 


“Developing in the area, including 
drugs for the Broad Institute, 
the brain has run jointly by Har- 
become high- vard University and 
risk, with most the Massachusetts 
candidates Institute of Tech- 


nology, which has 
both major DNA- 
sequencing capacity and a large group dedi- 
cated to psychiatric genetics. “Progress based 
on neurotransmitters has become small and 
incremental,’ Fishman told Nature. “Genetic 
analysis will provide a real scientific opportu- 
nity in psychiatric and cognitive disorders, even 
if new drugs only arrive in the distant future.” 
Recent population-based, genome-wide 
association studies have identified many dif- 
ferent genetic variants that individually confer 
tiny increased risk of disease, but which may 
act together to substantially increase a person's 
susceptibility. A type of defect known as copy 
number variation, in which large chunks of 
DNA are gained or lost, may on its own cause 
a significant increase in risk for psychiatric dis- 
ease. Scientists hope that understanding the > 


failing.” 
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biological function of these risk genes 
may yield new clues about how to prevent 
or treat diseases. 

Companies also hope that they will be 
able to find genetic biomarkers — a par- 
ticular DNA sequence or variation, for 
example — that will allow them to identify 
individuals who are likely to respond to a 
particular drug, says Kaitin. “It’s the basis 
of personalized medicine — health plans 
are more likely to pay for a new drug if they 
are convinced it will work for the person it 
is prescribed for” 

Stephen Stahl, a psychopharmacologist 
and psychiatrist at the University of Califor- 
nia, San Diego, says that other companies 
are also using gene sequencing to look for 
new targets. “GlaxoSmithKline pulled out 
of traditional CNS pharmacology but is 
pursuing research into neurodegeneration 
at its new sequencing centre in Shanghai.” 
It’s a promising approach, he adds, but he 
notes that no one yet knows how success- 
ful it will be. 

The genetic approach has delivered 
encouraging results for a five-year, €20-mil- 
lion (US$27-million) public-private part- 
nership called NEWMEDS, launched in 
2009 by the European Commission to 
speed the development of treatments for 
schizophrenia and depression. NEWMEDS 
coordinator Tine Bryan Stensbol, director 
of research at the Danish pharmaceutical 
company H. Lundbeck in Valby, says that 
scientists in the consortium have used 
genome information collected from the Ice- 
landic population by the Reykjavik-based 
genetics company deCODE to identify a 
copy number variant that they believe will 
be important in understanding the pathol- 
ogy of schizophrenia. The commission has 
awarded NEWMEDS a €1-million top-up 
to investigate further. 

Lundbeck is a medium-sized company 
that remains committed to drug discovery 
in psychiatric disease. It has, for example, 
just signed an agreement worth more than 
a billion dollars with the Japanese com- 
pany Otsuka Pharmaceutical to develop 
and market products for mental disorders. 
“We are certain that the rewards will come 
and I’m glad we will be well-placed to enjoy 
them,” says Stensbol. But she regrets that 
other companies are moving away, as she 
believes that competition can stimulate the 
field: “We need a critical mass of companies 
there when the new targets come.” 

Johnson & Johnson is one of the few major 
international firms keeping the faith. Its 
head of neuroscience, Husseini Manji, who 

is based in New Jersey, 
says that “things are 
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special: the breakthroughs 
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ready to roll 


Two rotating spheres separated by thousands of kilograms 
of liquid sodium aim to mimic Earth’s interior. 


BY SUSAN YOUNG 


he 3-metre-tall ribbed sphere looks like 
T a ‘visitor’ from another planet, dramati- 
cally lit and encased within a sturdy steel 

box. In fact, the giant orb, housed in a cavernous 
warehouse at the University of Maryland, Col- 
lege Park, is meant to approximate Earth's core. 
Ten years in the making, the US$2-million 
project is nearly ready for its inaugural run. 
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Early next year, the sphere will begin whirling 
around while loaded with 13,000 kilograms 
of molten sodium heated to around 105°C. 
Researchers hope that the churning, electri- 
cally conducting fluid will generate a self- 
sustaining electromagnetic field that can be 
poked, prodded and coaxed for clues about 
Earth’s dynamo, which is generated by the 
movement of liquid iron in the outer core. 
If it works, it will be the first time that an 


S.TRIANA 


IN FOCUS 


biological function of these risk genes 
may yield new clues about how to prevent 
or treat diseases. 

Companies also hope that they will be 
able to find genetic biomarkers — a par- 
ticular DNA sequence or variation, for 
example — that will allow them to identify 
individuals who are likely to respond to a 
particular drug, says Kaitin. “It’s the basis 
of personalized medicine — health plans 
are more likely to pay for a new drug if they 
are convinced it will work for the person it 
is prescribed for” 

Stephen Stahl, a psychopharmacologist 
and psychiatrist at the University of Califor- 
nia, San Diego, says that other companies 
are also using gene sequencing to look for 
new targets. “GlaxoSmithKline pulled out 
of traditional CNS pharmacology but is 
pursuing research into neurodegeneration 
at its new sequencing centre in Shanghai.” 
It’s a promising approach, he adds, but he 
notes that no one yet knows how success- 
ful it will be. 

The genetic approach has delivered 
encouraging results for a five-year, €20-mil- 
lion (US$27-million) public-private part- 
nership called NEWMEDS, launched in 
2009 by the European Commission to 
speed the development of treatments for 
schizophrenia and depression. NEWMEDS 
coordinator Tine Bryan Stensbol, director 
of research at the Danish pharmaceutical 
company H. Lundbeck in Valby, says that 
scientists in the consortium have used 
genome information collected from the Ice- 
landic population by the Reykjavik-based 
genetics company deCODE to identify a 
copy number variant that they believe will 
be important in understanding the pathol- 
ogy of schizophrenia. The commission has 
awarded NEWMEDS a €1-million top-up 
to investigate further. 

Lundbeck is a medium-sized company 
that remains committed to drug discovery 
in psychiatric disease. It has, for example, 
just signed an agreement worth more than 
a billion dollars with the Japanese com- 
pany Otsuka Pharmaceutical to develop 
and market products for mental disorders. 
“We are certain that the rewards will come 
and I’m glad we will be well-placed to enjoy 
them,” says Stensbol. But she regrets that 
other companies are moving away, as she 
believes that competition can stimulate the 
field: “We need a critical mass of companies 
there when the new targets come.” 

Johnson & Johnson is one of the few major 
international firms keeping the faith. Its 
head of neuroscience, Husseini Manji, who 

is based in New Jersey, 
says that “things are 


See Nature's cyclical — for those 
schizophrenia who stay the course, 
special: the breakthroughs 


will finally come” = 


namo maker 


ready to roll 


Two rotating spheres separated by thousands of kilograms 
of liquid sodium aim to mimic Earth’s interior. 


BY SUSAN YOUNG 


he 3-metre-tall ribbed sphere looks like 
T a ‘visitor’ from another planet, dramati- 
cally lit and encased within a sturdy steel 

box. In fact, the giant orb, housed in a cavernous 
warehouse at the University of Maryland, Col- 
lege Park, is meant to approximate Earth's core. 
Ten years in the making, the US$2-million 
project is nearly ready for its inaugural run. 
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Early next year, the sphere will begin whirling 
around while loaded with 13,000 kilograms 
of molten sodium heated to around 105°C. 
Researchers hope that the churning, electri- 
cally conducting fluid will generate a self- 
sustaining electromagnetic field that can be 
poked, prodded and coaxed for clues about 
Earth’s dynamo, which is generated by the 
movement of liquid iron in the outer core. 
If it works, it will be the first time that an 


S.TRIANA 


experiment that mirrors the configuration of 
Earth’s interior has managed to recreate such 
a phenomenon. 

“Dynamos are easy to generate in nature,” 
says project leader and experimental geophysi- 
cist Daniel Lathrop. “The same is not true for 
the lab,” Yet simulating this process in the lab- 
oratory will guide our understanding of 
processes that take place 3,000 kilometres 
below ground, at depths that can be probed 
only indirectly — by analysing seismic waves 
that have travelled through Earth's deep inte- 
rior, for example. 

“There's no way for us to measure anything 
near the core directly,’ says Lathrop. “It exceeds 
the deepest well by a factor of 100” 

The University of Maryland set-up consists 
of two concentric spheres. The inner sphere, at 
1 metre across, stands in for Earth's solid inner 
core; the outer sphere the edge of Earth's man- 
tle. The space between the two is filled with liq- 
uid sodium, mimicking the liquid outer core. 
Each sphere is powered by a separate motor so 
that it can rotate independently of the other. By 
spinning the spheres across a range of matched 
and unmatched velocities — up to 4 revolu- 
tions per second for the outer sphere and 12 for 
the inner — Lathrop and his team will study 
how heat and rotation might affect the move- 
ment of the molten iron in Earth’s core. 

But the question that looms largest is 
whether the churning sodium will mimic the 
geodynamo and generate a self-perpetuat- 
ing magnetic field. The experiment will use 
Earth’s natural magnetism as a ‘seed field’ to 
kick-start the process. As this field is dragged 
and stretched by the spinning, conducting 
liquid it will generate electric currents. Those 
currents will then create additional magnetic 
fields that, when sufficiently twisted around, 
can amplify themselves and drive the process 
forward. No one knows whether this feedback 
loop will work, says Lathrop, because “there 
are neither theory nor experiments at these 
parameters”. 

Enthusiasm for the effort is building beyond 
Lathrop’s group. “Everyone in the community 
is waiting with bated breath,’ says Andrew 
Jackson, a geophysicist at the Swiss Federal 
Institute of Technology Zurich. “He’s asking 
questions that we don't know the answer to.” 


NEAR EARTH 
The Maryland experiment expands on earlier 
efforts such as that built by a collabora- 
tion between the Ecole Normale Supérieure 
institutes in Paris and Lyon and the French 
Atomic Energy Commission in Saclay. Built in 
Cadarache, France, the experiment uses spin- 
ning iron discs to force turbulence in liquid 
sodium within a cylin- 


NATURE.COM drical case. In 2007, the 
For a video of the team reported that the 
experimentin combination of motion 
action see: and a seed magnetic 
go.nature.com/guofxe field led to the formation 
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RECREATING EARTH'S CORE IN THE LAB 


Earth’s core is inaccessible, but a sphere filled with liquid sodium at the University of Maryland may uncover some 
of its hidden properties. An instrumentation port in the rotating lid allows researchers to monitor temperature, fluid 
flow and other properties while the inner and outer spheres spin independently of each other. 


Outer Core 
(liquid) 


By 5,150 km 
y 
6,370 km 


Inner core 
(solid) 


ofa dynamo with some resemblance to Earth's 
(M. Berhanu et al. Europhys. Lett. 77, 59001; 
2007). The next step is to explore dynamos in 
set-ups that more closely match reality, says 
Jean-Frangois Pinton, an investigator with the 
French experiment, which is still running. 

“Our experiment is a physicist’s experiment,” 
Pinton says, because it aims only to generate a 
dynamo, regardless of how realistic the set-up 
is. “Dan is after what Earth does.” 

Other groups have also approximated 
planetary shapes in liquid-sodium experi- 
ments, but they haven’t managed to gener- 
ate self-sustaining dynamos. A group at the 
University of Wisconsin-Madison has a 
1-metre-diameter 
spherical experiment 
that doesnt rotate, but 
instead drives turbu- 
lence with internal 
propellers. A group 
in Grenoble, France, 
built a 40-centime- 
tre rotating set-up with a sodium-filled outer 
sphere and a solid inner sphere. Both experi- 
ments have uncovered interesting properties of 
fluid dynamics of conducting liquids, but not 
created a dynamo. 

Lathrop’s latest construction is the fourth 
and largest in a series of sodium-spinners 
built by his group. None has yet generated a 
dynamo, but the size of the new sphere should 
give it a better chance of doing so. 

Lathrop and others hope that their work will 
shed light on how rotational forces in Earth’s 
core deflect flows of electrically conducting 


“Dynamos are 
easy to generate 
in nature. The 
same is not true 
for the lab.” 


Bearing Shaft Instrumentation port 


Outer jacket 
for heating 
and cooling 


liquid into a configuration that produces a 
magnetic field with north and south poles. 
It might also help to explain what triggers 
changes in Earth’s magnetic field such as 
north-south pole reversals, which occur on 
average every few hundred thousand years, 
but can happen after a few tens of thousands 
of years or a few million. The last reversal was 
some 780,000 years ago. Over the past century 
and a half, the magnetic field has weakened by 
about 10%, leading some researchers to sug- 
gest that another pole-reversal is in the offing. 

Although a pole reversal is unlikely to cause 
any major disruptions, the weakening of 
Earth’s magnetic field is “at least mildly alarm- 
ing’, says Lathrop, because the field shields us 
from charged solar particles, which can disrupt 
power grids (see Nature 441, 402-404; 2006). 
“Earth’s magnetic shield is part of what makes 
this planet hospitable,’ says Lathrop. 

The sphere is now nearly three-quarters 
full and is awaiting its final shipment of solid 
sodium, expected next month. Once the last 
barrel of sodium is put in, the team will recon- 
figure the experiment for spinning. 

“We've taken all the steps that we can think 
of to try to get dynamo action, but we don't 
have a detailed prediction about whether or 
not those steps will be successful,” says Daniel 
Zimmerman, a postdoc at Maryland who has 
been with the project since the beginning. 

But Lathrop isn’t worried, saying that a 
large rotating volume of a liquid conductor 
is certain to churn up some surprises. “With 
or without the dynamo action it’s going to be 
interesting,” he says. = 
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EDUCATION 


Student protesters disrupted board meetings at the University of California in November. 


State cuts fuel 
California protests 


Fears raised of lasting damage to public university system. 


BY ERIKA CHECK HAYDEN 


educed services. Shrinking programmes. 

Rese tuition fees. These are the signs 

f an ever more dire budget situation at 

the University of California, which has suffered 
years of state funding cutbacks. 

Discomfort spread to the boardroom on 
28 November, when student protesters at four 
campuses temporarily shut down a teleconfer- 
ence of the University of California regents, the 
body that sets policy and budgets for the ten- 
campus statewide system. The protests were 
sparked in part by an incident on 18 Novem- 
ber, in which a policeman at the Davis campus 
doused student protesters in pepper spray, 
but the latest grievances also included ongo- 
ing hikes in undergraduate tuition fees, which 
have risen by 71% since 2008, and budget cuts 
that could reach US$750 million this year. 

One of the world’s leading public univer- 
sities, the University of California system is 
reeling from the impact of the global financial 
crisis, decades of declining public support for 
education and a state budget crisis, says John 
Douglass, a higher-education researcher at the 
University of California, Berkeley. “The great 
recession has been a huge blow after a million 


small cuts, and now youre seeing a transforma- 
tion of the funding model,” he adds. The system 
increasingly relies on tuition fees, says Douglass. 
This year, for the first time, the University of 
California drew more of its funding from pri- 
vate sources, including fees, than from the state 
(see ‘Getting less for more’). California now pro- 
vides just 11% of the university's $22.5-billion 
budget, down from 30% in the 1980s. 


GETTING LESS FOR MORE 


Despite tuition-fee increases, spending per student at 
the University of California (UC) is now about 80% of 
what it was in 1990, owing to falling state funding. 


{Student MJ UC general If State general 


g fees funds funds 

LPS ope eee ee 

8 

3 

A 

n 

2 1B «4 

€ 

S 

310 

wn 

a 

= 

2 

rn) 

a 1990 1995 2000 2005 2011 
Fon eoG) col OG nT 


164 | NATURE | VOL 480 | 8 DECEMBER 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


Undergraduates have borne the brunt, fac- 
ing higher fees, fewer courses and larger class 
sizes. Scientific researchers are less directly 
affected because they get much of their fund- 
ing from federal grants and private founda- 
tions. But they are also feeling the effects of the 
squeeze as graduate students face constraints 
on resources and time, and campus services 
and maintenance are cut. 

Philippe Marchand, a graduate student in 
environmental science and policy manage- 
ment at Berkeley, needs to increase the amount 
of paid teaching he does to cover his tuition 
fees. But teaching budgets across the system 
have fallen, so fewer teaching slots are avail- 
able, and he is unsure whether the option will 
be available next semester. But even ifhe can 
teach, it will not solve his problems. “Tm going 
to have to make some difficult choices; if I have 
to teach every semester I may have to suspend 
my research,” he says. Since 2006, the number 
of graduate students in Marchand’s department 
has shrunk by 30%. 

Jessica Smith, a chemistry graduate stu- 
dent at Berkeley, says that ceiling leaks in labs 
have ruined experiments and machinery, 
although one leak was fixed by funnelling the 
water away using rubber tubing. “The biggest 
budget-related impediment to my research is 
our crumbling facilities,” she says. 

The cuts are also making it harder for the 
university to retain staff, says Jeffrey Bluestone, 
executive vice-chancellor of the University 
of California, San Francisco (UCSF), which 
relies on state contributions to pay faculty 
salaries and benefits, and so cannot match the 
packages at some private institutions. Losses 
include Didier Stainier, a geneticist who went 
to the Max Planck Institute for Heart and Lung 
Research in Bad Nauheim, Germany, and 
Christopher Voigt, a synthetic biologist who 
left UCSF this year for the Massachusetts Insti- 
tute of Technology in Cambridge. “In a differ- 
ent world, we could have kept him,” Bluestone 
says of Voigt. 

More broadly, ongoing cuts will hamper the 
system's ability to nurture California’s technol- 
ogy sector, “which is heavily reliant on bring- 
ing in talent from universities’, says Douglass. 

A report on the future of the University 
of California, released in November 2010, 
led campuses to increase enrolment of stu- 
dents from other states, who pay higher fees. 
Another fund-raising option, not yet adopted, 
is to allow the most prestigious campuses, such 
as Berkeley and Los Angeles, to charge higher 
tuition fees than the others. And in state elec- 
tions next year, voters will probably be asked to 
consider boosting taxes, in part to fund higher 
education. 

Douglass says that such measures are unlikely 
to reverse the major declines in state funding. 
The University of California must try to staunch 
the bleeding before its reputation as a success- 
ful alternative to costly private US universities 
is irreparably damaged. m 
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China pushes to rule the waves 


Launch of huge Kexue research vessel could put country at forefront of ocean science. 


BY JANE QIU IN BEIJING 


he most sophisticated ship yet to join 
[Penis research fleet was launched 

last week, marking the latest step in the 
country’s push to boost marine science. 

Dubbed Kexue (which means ‘science’), 
the 100-metre-long vessel dwarfs research 
ships operated by some other oceanography 
powerhouses, such as the United States. 
It “will be one of China’s key research 
vessels in the next couple of decades”, 
said Ding Zhongli, vice-president of the 
Chinese Academy of Sciences (CAS), at 
the launch ceremony on 30 November. 
“It is a state-of-the-art moving labo- 
ratory for fundamental research and 
technological development in marine 
science, especially deep-sea research” 

Kexue, which cost about 550 million 
renminbi (US$86 million), will be able to 
carry 80 people at a top speed of 15 knots 
on voyages of 28,000 kilometres, last- 
ing up to 60 days. It is modelled on the 
United Kingdom's RRS James Cook and 
Norway’s G.O. Sars, says Sun Song, 
director of the CAS Institute of Ocean- 
ology in Qingdao, Shandong province, 
which is in charge of Kexue’s design and 
operation. “It is on a par with these top 
research vessels,” he says. 

Most ocean-going research vessels in 
China have been designed to explore for 
mineral or fuel resources, and although 
CAS and some universities have more 
than half a dozen vessels fitted out for 


scientific studies, few have the capacity | The Kexue research vessel was launched in Wuhan, China. 


to explore much further than China’s 

coastal waters. “The lab space on those ships 
is very limited and facilities are not up to 
scratch,’ says Zhao Meixun, a marine ecolo- 
gist at the Ocean University of China (OUC) 
in Qingdao. For example, Dongfanghong 2, 
an OUC research vessel, lacks the relevant 
sampling and measurement tool to study sea 
water's iron content — a vital nutrient for algae 


> 


MORE 
ONLINE 


Ye 


TOP STORY 


Septin proteins take bacterial 
prisoners go.nature.com/ri7s41 


that absorb carbon dioxide. It also lacks equip- 
ment for detecting trace elements, or carrying 
out complex biological studies. 

With 360 square metres of lab space, Kexue 
can accommodate a wide range of oceano- 
graphic studies, including measurements of 
ocean and atmospheric circulation, water and 
sediment sampling and on-board analyses, 
deep-sea exploration, acoustic studies and 


ocean-floor seismic experiments. It can also 
launch manned submersibles, such as the Jiao- 
long (‘Sea dragor’) craft, which descended to a 
depth of 5,000 metres in July. 

“Kexue is an impressive ship with a lot of 
cool features,” says Shangping Xie, a clima- 
tologist and oceanographer at the University 
of Hawaii in Honolulu. Its propellers, driven 
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by electric motors, are housed inside pods 
that can be rotated in any horizontal direc- 
tion, making the ship much more manoeu- 
vrable than if it had a fixed propeller and 
rudder system, says Sun. The electric motors, 
together with noise- and vibration-reduction 
measures, make the ship particularly quiet — 
ideal for marine-biology research. Kexue is the 
first research vessel in the world to boast these 
sophisticated propulsion pods, which 
together with positioning and motion- 
reduction systems will allow it to main- 
tain a fixed position in high winds and 
rough seas, says Sun. 

Yet the ship could prove too costly 
for some researchers. Unlike countries, 
such as Japan and the United States, that 
bear the costs of scientists’ ship time, 
China will ask scientists to pay a sub- 
stantial portion of the costs of operating 
Kexue out of their own grants, says Xie. 
Weijun Cai, an oceanographer at the 
University of Georgia in Athens, says 
that Kexue should be complemented 
with more small- and medium-sized 
ships that allow small teams of scientists 
to carry out focused research on a lower 
budget. “This is particularly important 
for the majority of young and less well- 
funded researchers,” he says. 

Other researchers say that to realize 
Kexue’s full potential, there should be 
more sharing of data and equipment 
between marine scientists, and bet- 
ter technical training of on-board staff 
to ensure that data sets from different 
missions are comparable. 

On Kexue’s maiden voyage to the 
western Pacific Ocean next summer, scientists 
on board will study marine biodiversity, the 
deep-sea environment and the oceans effect 
on climate in China. Researchers around the 
world are welcome to apply for ship time, says 
Sun. “Kexue will bea platform for joint marine 
research and observation with our foreign 
colleagues in the coming decades.” m 
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Uncertain 
sanctuary 


Nations are racing to establish marine 
protected areas, but it’s not clear whether 
many are living up to the name. 


BY DANIEL CRESSEY 


ormer US President George W. Bush did not garner much 

applause from environmentalists during his eight years in 

the White House, but on 15 June 2006, he gave them some- 

thing to cheer about. Bush signed an order to create the 
Papahanaumokuakea Marine National Monument in Hawaii, then the 
world’s largest ocean conservation area. Spanning about 360,000 square 
kilometres of the Pacific Ocean, the reserve is designed to safeguard 
7,000 marine species including the rare Hawaiian monk seal (Monachus 
schauinslandi), hawksbill (Eretmochelys imbricata) and leatherback tur- 
tles (Dermochelys coriacea) and nearly two dozen other animals on the 
US endangered species list. It also started a race between other nations 
to follow suit with giant reserves of their own. 

Since then, Australia, the United Kingdom and the Pacific nation of 
Kiribati have created still larger ocean conservation zones. And nations 
are discussing plans to make other huge reserves that will encompass 
much of the Southern Ocean and the Sargasso Sea. Worldwide, about 4 
million square kilometres have now been set aside in marine protected 
areas (MPAs). 

Despite that rapid expansion, many of the MPAs fall well short of the 
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grand plans that governments once trumpeted 
to protect the world’s oceans and the species that 
live in them. The 2002 Convention on Biological 
Diversity called on the world to protect 10% of 
the oceans by 2012, but the MPAs created so far cover little more than 1%. 

Studies show that some of the MPAs, suchas Australia’s Great Barrier 
Reef Marine Park, have provided strong protection for crucial ecosys- 
tems’. But scientists fret that many MPAs exist only on paper and offer 
no real benefits to the ecosystems or the people within or near them. 
Some have design flaws, such as protecting the wrong habitats. In others, 
there are no efforts to enforce limits on fishing or other activities within 
the protected areas. “The biggest thing we can do wrong is to claim to 
be protecting, when in fact the design of the place or the nature of the 
management is such that there is no effective protection,’ says Peter 
Sale, a marine scientist and professor emeritus at Canada’s University 
of Windsor in Port Carling, Ontario. 

These problems are likely to grow as countries increasingly join forces 
to develop larger and more expensive international protected areas on 
the high seas. Enforcement and monitoring will be more difficult in 
these places than they are in coastal waters. Some scientists worry that 
these concerns are not being openly discussed. “There's almost a reli- 
gion, which is nurtured by non-governmental organizations and some 
members of the science community, to the extent that critical examina- 
tion of MPAs is exceedingly difficult to get published,” says Sale. 


Australia plans to put 
protections in place for 
the Coral Sea. 


A SEA CHANGE 
The term MPA can cover a vast range of entities, from tiny reserves less 
than a square kilometre in size to Britain’s 544,000-square-kilometre 
Chagos Marine Protected Area, which wraps around 55 small islands 
in the Indian Ocean (see ‘Fencing the sea’). ‘“No-take’ MPAs ban fishing 
outright, but others permit certain types of fishing in specific zones. 
One of the forces driving the expansion of larger MPAs has been the 
Global Oceans Legacy programme of the Pew Environment Group in 
Washington DC. In the mid-2000s, Pew identified a handful of sites 
as suitable for mega-MPAs because they had relatively low economic 
value and were owned by countries that had a history of conservation 
and the ability to enforce protections. 
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The United States has established MPAs in two of these areas: the 
Papahanaumokuakea protected zone and one in the Mariana Trench 
in the western Pacific. In late November, Australia proposed plans for 
a 990,000-square-kilometre reserve in the Coral Sea off the country’s 
northeast coast. Another of the sites highlighted by Pew was New Zea- 
land’s Kermadec Islands, where protections are currently in place for 
the sea floor, and conservationists would like to extend them to the 
entire water column. 

Jay Nelson, director of the global ocean legacy programme at Pew, sees 
the Papahanaumokuakea MPA as the marine equivalent of Yellowstone 
National Park, the first big parcel of land to be set aside as a reserve. The 
creation of that park in 1872, he says, set offa chain reaction, prompting 
other countries to establish large parks of their own. “That hada similar 
effect to what Papahanaumokuakea has had in the oceans.” 

Countries have picked up the pace of establishing reserves as the dead- 
line approaches for the 2012 target. In 2008, Daniel Pauly, a fisheries 
biologist at the University of British Columbia in Vancouver, Canada, 
estimated that 2.35 million square kilometres of ocean had been pro- 
tected’. In late 2010, a report from the United Nations Environment 
Programme (UNEP) and the International Union for Conservation of 
Nature and Nature Conservancy charities put the area at 4 million square 
kilometres, with the number of MPAs nearing 5,900. 

That area represents just 1.17% of the world’s seas. So the deadline for 
reaching 10% was extended to 2020 at last year’s meeting of countries 
that have signed up to the UN Convention on Biological Diversity. 

The recent spike in protected areas has Pauly feeling hopeful, despite 
the failure to meet the 2012 target. “Maybe vast areas of the ocean will 
be put under protection in time to prevent bad things from happening,’ 
he says. 


RESERVES WORK 
Scientists and conservationists agree that a good MPA can make a 
difference. In 2009, Sarah Lester, a marine ecologist at the University of 
California, Santa Barbara, and her colleagues examined published peer- 
reviewed reports on 124 no-take marine reserves in 29 countries*. They 
found documented increases in characteristics such as biomass, species 
richness and population size within the boundaries of the reserves. 
“By and large, everyone agrees that MPAs can achieve their conser- 
vation goals if they are enforced,’ says Benjamin Halpern, a co-author 
on the study and director of the Center for Marine Assessment and 
Planning at Santa Barbara. 
But many MPAs are worth little more than the paper they are written 


FENCING THE SEA 


Several giant marine reserves have been created in the past decade, including 
the five biggest ones with protection from sea floor to the ocean surface. 
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on because nothing is done to enforce the rules or monitor the area. 
Earlier this year, Peter Jones, an MPA researcher at University College 
London, and his colleagues produced a report' for UNEP analysing the 
governance of 20 MPAs, including those in iconic regions such as the 
Great Barrier Reef and the Galapagos. The team found that none of the 
MPAs rated more than three out of five points for their effectiveness at 
meeting key objectives, such as preserving biodiversity and fostering 
sustainable use of resources. “Three of the 20 MPAs we've said had no 
effectiveness whatsoever — completely paper parks,’ Jones says. 

Of these, Baleie Franca in Brazil and the Cres-LoSinj reserve in Croatia 
scored one. Pirajubaé in Brazil scored zero, in part because the govern- 
ment approved a highway in the area that harmed coastal habitats and 
fishing grounds, the report says. 

Even countries that do try to regulate their MPAs can find it hard to 
fund them. “We don't have the resources that we need to actually moni- 
tor, enforce and understand these areas,’ says Jane Lubchenco, head of 
the US National Oceanic and Atmospheric Administration. 

Poor planning can also hamper an MPA. Mexico, for example, set 
aside parts of the Gulf of California to protect the endangered vaquita 
— Phocoena sinus, a type of porpoise — but did not include a large part 
of the species’ range. 

The Australian government has also been criticized over its efforts at 
marine zoning. In August, a group of 173 scientists published an open 
letter warning that a proposed 320,000-square-kilometre reserve off 
the southwestern coast “fails on the most basic test” by not protecting 
a representative selection of habitats in the region. Some suggest that 
Australia’s proposal for a Coral Sea MPA faces similar issues (see Nature 
480, 14-15;2011). 

Inan e-mail to Nature about the southwestern reserve, a spokesperson 
from the Australian Department of Sustainability, Environment, Water, 
Population and Communities said that the government has a responsi- 
bility to balance social and economic factors along with ecological ones 
in decisions about conservation. “The department does not accept that 
there is a lack of protection for valuable habitat in the proposed marine 
reserves network,’ said the spokesperson. 

Researchers say that there are ways to improve reserve planning. 
Many nations are now setting up MPAs using a software called Marxan, 
which uses an algorithm to calculate the areas that should be included in 
order to protect the most important habitat while minimizing the cost of 
the reserve. One of the driving forces behind the software, Hugh Possin- 
gham, head of the Spatial Ecology Lab at the University of Queensland 
in Brisbane, Australia, says that the best approach basically amounts to 
saying “get me 10% of everything and annoy as few people as possible”. 
Australia used Marxan to design the Great Barrier Reef MPA, which 
sets aside large fractions of some 70 types of habitat. 

Looking forward, researchers want to investigate how best to moni- 
tor and manage the increasing numbers of MPAs. As they assess the 
effectiveness of reserves, scientists sometimes find surprises, such as 
the fate of the monk seals in Hawaii’s Papahanaumokuakea MPA. The 
seal population has increased outside the protected region but declined 
inside it, perhaps because sharks have thrived there*. Ashley McCrea 
Strub, a postdoctoral fellow working with Pauly’s lab, says that there are 
very few data on the cost effectiveness of reserves and it will be vital to 
fill that gap. “That's really the future of where we need to go,’ she says. 
“That's an enormously important question.” 

Some scientists have argued for doing more studies on MPAs before 
moving forward with major plans for new ones. But Possingham says 
the benefits in most cases are clear enough. “If you wait till you have 
perfect data,” he says, “then you'll be waiting forever.” m SEE EDITORIAL P.151 


Daniel Cressey is a reporter with Nature in London. 
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Master of illusion 


Henrik Ehrsson uses mannequins, rubber arms and virtual 
reality to create body illusions, allin the name of neuroscience. 


from your body and then stabbed in the 
chest with a kitchen knife. 

But such experiences are routine in the lab 
of Henrik Ehrsson, a neuroscientist at the 
Karolinska Institute in Stockholm, who uses 
illusions to probe, stretch and displace people's 
sense of self. Today, using little more than a 
video camera, goggles and two sticks, he has 
convinced me that I am floating a few metres 
behind my own body. As I see a knife plung- 
ing towards my virtual chest, I flinch. Two 
electrodes on my fingers record the sweat that 
automatically erupts on my skin, anda nearby 
laptop plots my spiking fear on a graph. 

Out-of-body experiences are just part of 


I: is not every day that you are separated 


BY ED YONG 


Ehrsson’s repertoire. He has convinced people 
that they have swapped bodies with another 
person’, gained a third arm’, shrunk to the size 
of a doll or grown to giant proportions’. The 
storeroom in his lab is stuffed with mannequins 
of various sizes, disembodied dolls’ heads, 
fake hands, cameras, knives and hammers. 
It looks like a serial killer’s basement. “The 
other neuroscientists think we're a little crazy,’ 
Ehrsson admits. 

But Ehrsson’s unorthodox apparatus amount 
to more than cheap trickery. They are part of 
his quest to understand how people come to 
experience a sense of self, located within their 
own bodies. The feeling of body ownership is 
so ingrained that few people ever think about 
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it — and those scientists and philosophers who 
do have assumed that it was unassailable. 

“Descartes said that if there’s something 
you can be certain of in this world, it’s that 
your hand is your hand,” says Ehrsson. Yet 
Ehrsson’s illusions have shown that such cer- 
tainties, built on a lifetime of experience, can 
be disrupted with just ten seconds of visual 
and tactile deception. This surprising malle- 
ability suggests that the brain continuously 
constructs its feeling of body ownership using 
information from the senses — a finding that 
has earned Ehrsson publications in Science 
and other top journals, along with the atten- 
tion of other neuroscientists. 

“A lot of people thought the sense of self 
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was hard-wired, but it’s not at all. It can be 
changed very quickly, and that’s very intrigu- 
ing,’ says Miguel Nicolelis, a neurobiologist at 
Duke University Medical Center in Durham, 
North Carolina. 

Ehrsson’s work also intrigues neuroscientists 
and philosophers because it turns a slippery, 
metaphysical construct — the self — into some- 
thing that scientists can dissect. “We can say if 
we wobble the signals this way, our conscious 
experience wobbles in this way,’ says David 
Eagleman, a neuroscientist who studies percep- 
tion at Baylor College of Medicine in Houston, 
Texas. “That’s a lever we didn't have before.” 

“There are things like selfhood that people 
think cannot be touched by the hard sciences,’ 
says Thomas Metzinger, director of the Theo- 
retical Philosophy Group at Johannes Guten- 
berg University of Mainz, Germany. “They 
are now demonstrably tractable. That’s what I 
think is valuable about Henrik’s contribution” 


DAYDREAM BELIEVER 

Ehrsson was born in 1972 ina suburb of Stock- 
holm. His father was a chemist, his grandfa- 
ther a dentist, and both stoked an interest in 
science and the human body that led him to 
study medicine at the Karolinska Institute. But 
in long anatomy classes, Ehrsson often found 
himself bored. “During lectures, I kept on 
thinking that if my eyes were floating over 
there and I was looking at myself from 
that perspective, where would my 
consciousness be?” After a pause: 
“I was not an A student.” 

After graduating, Ehrsson left 
medicine to start a PhD at the 
Karolinska, using brain scan- 
ners to study how people grasp 
objects. At the same time, he 
was becoming deeply fascinated 
by physical illusions. Some of 
these are well established: Aris- 
totle discovered that crossing 
the index and middle fingers and 
touching the nose can — in some 
people — create a feeling of having 
two noses. Ehrsson also heard about the 
rubber-hand illusion, a trick devised in the 
late 1990s by researchers in the United States. 
They could convince people that a fake hand 
was their own by hiding their real hand under 
a table, placing a rubber one in front of them, 
and stroking both in the same way’. “It really 
worked on me,” Ehrsson says. “It was fantastic 
and surreal.” 

Ehrsson started investigating illusions on 
the side. By the time he completed his postdoc 
at University College London and returned 
to the Karolinska Institute to start his own 
lab, illusions had become his main focus. He 
knew that many scientists use visual illusions 
to glean the basics of perception. “There are 
conferences on visual illusions, and yearly con- 
tests to create the best new ones,’ he says. “But 
there aren't many body illusions. The body has 


never been a main focus of psychology.’ It was 
these tricks, such as the rubber-hand illusion, 
that Ehrsson was keen to explore. He wanted 
to test how easily the sense of body ownership 
could be distorted. 

Ehrsson set about creating more illusions 
based on the same principles as the rubber 
hand one. Headsets, cameras or fake body 
parts fooled the eyes, and synchronous strokes 
and prods added a tactile clincher. In 2007, he 
reported that he had used such props to con- 
vince subjects that they had left their own bod- 
ies’ — a stunt that attracted headlines around 
the world. 

At the time, some scientists and members of 
the public were openly sceptical that the illu- 
sion really worked. But ona trip to Ehrsson’s lab 
this September, I was convinced. The goggles I 
wore displayed the view from a camera point- 
ing at my back (see ‘Out-of-body experience’). 
Ehrsson tapped my chest with one plastic rod 
while using a second one to synchronously 
prod at the camera. I saw and felt my chest 
being prodded at the same time as I saw a pic- 
ture of myself from behind. Within ten seconds, 
I felt as if I was being pulled out of my real body 


“Henrik’s 
work speaks to 
the idea that there 
is no such thing as a 
soul or a self that’s 
independent of 
the brain.” 


and was floating several feet behind it. 

A year after removing his subjects from their 
own bodies, Ehrsson learned how to trick them 
into acquiring new ones. This time, the volun- 
teers’ goggles showed them the view from a 
camera on the head of a mannequin looking 
at its own plastic torso. Simultaneously poking 
the arm or stomach of the mannequin and the 
volunteer a few times was enough to convince 
the subjects that they were the dummy. They 
could even stare at their old bodies from their 
new ones and shake hands with their old self, 
all without breaking the spell’. “It really is very 
intense and incredibly fast; says Mark Hallett, 


FEATURE | NEWS 


a neurologist from the National Institutes of 
Health in Bethesda, Maryland, who experi- 
enced it first hand. 

In his latest trick, published in May” , Ehrs- 
son convinced people that they had jumped 
into a tiny Barbie doll. When he prodded the 
doll’s legs, the volunteers thought they were 
being prodded by giant objects. And when 
Ehrsson tested the illusion on himself and a 
colleague touched his cheek, he says, he looked 
up and “felt as if I was back in my childhood 
and looking at my mother”. 

Not everyone succumbs. Ehrsson sus- 
pects that people who can expertly localize 
their limbs without sight, such as dancers 
or musicians, would be less susceptible than 
the students with whom he normally works. 
But typically, the illusions work for around 
four out of five people. Ehrsson confirms 
the effects by asking his volunteers about 
their experiences and by threatening their 
disembodied, shrunken or plastic forms 
with a knife. If the illusion has worked, then 
volunteers show a reflexive burst of nervous 
sweating (as I did). Even knowing the knife is 
coming is no defence: as Ehrsson puts it, the 
illusions are “cognitively impregnable”. 

Earlier this year, Ehrsson even tweaked the 
rubber-hand illusion and convinced people 
they owned a third hand’. “He has taken 
those basic ideas and investigated how far 

you can push them,” says neuroscientist 
Matthew Botvinick, one of the crea- 
tors of the rubber-hand illusion, 
at Princeton University in New 

Jersey. “He's shown how extreme 

and how malleable the body 

representation can be.” 


SELF DELUSION 
Ehrsson’s next challenge is to 
work out what these illusions 
reveal about the brain. Accord- 
ing to textbook wisdom, people 
build up a perception of their bod- 
ies using ‘proprioception — signals 
from the skin, muscles and joints that 
indicate the relative position of body 
parts. But Ehrsson’s illusions show that 
vision and touch are also a crucial part of the 
mix, and that the brain builds a sense of self 
by constantly compiling information from all 
these senses. Proprioception may be telling 
the brain that the body is seated in a chair, but 
the carefully timed vision and touch signals in 
Ehrsson’s illusion convince the brain that it is 

somewhere else entirely. 

Ehrsson thinks such illusions depend on 
‘multisensory’ neurons, which have been 
studied mostly in monkeys and allow the ani- 
mals to successfully interact with objects by 
combining vision and touch. “We think these 
circuits are important, not just for representing 
external objects,” says Ehrsson, “but represent- 
ing your own body and the boundary between 
your body and the world” He thinks that the 
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OUT-OF-BODY EXPERIENCE 


1. A subject wears goggles showing the view 
from a camera behind him. An experimenter 
prods the subject’s chest at the same time 
as prodding at the camera. 


neurons integrate information from across 
the senses to create a cohesive representation 
of the body. During an illusion, he simply 
changes the data flooding into the neurons to 
manipulate that representation. 

For now, this is a working hypothesis. “The 
details of the multisensory integration have 
never been formally worked out,’ says Botvin- 
ick. “That’s the missing link for me? Ehrsson 
and others are busy trying to see where these 
multisensory neurons reside in the human 
brain, by inducing illusions while volunteers 
sit inside a functional magnetic resonance 
imaging machine. The results differ. Ehrsson 
has found that the ventral premotor cortex 
— known to be involved in visual guidance 
of movements — is particularly active when 
people experience the full body-swap illu- 
sion’. Olaf Blanke at the University of Laus- 
anne in Switzerland, one of only a handful of 
other researchers in this field, has shown that 
the nearby temporoparietal junction lights 
up when people experience out-of-body illu- 
sions’. He points out that brain damage or 
tumours in that area can induce disembodied 
feelings. “It’s difficult to judge which is right, 
because we just have a thin neuroscientific 
layer of data at the moment,’ says Blanke. 


SOUL SEARCHING 

Ehrsson’s interests in the self and body owner- 
ship bleed into the rest of his life. He is drawn 
towards experimental theatre and surrealist 
art. He comes across as thoughtful and remote, 
easily forgetting the names of colleagues and 
often lapsing into silence. He prefers working 
in his lab to going to conferences, he says. 

He also occasionally gets angry letters from 
people who have had out-of-body experiences 
themselves. “They believe that their souls have 
left their bodies, and they feel threatened that 
a similar experience can be induced in a lab” 
says Ehrsson. He offers a diplomatic response, 


2. The subject sees the hand prodding towards the 
camera as he feels his chest being prodded. He also sees 
his body from behind. This creates a vivid sense that his 
real body is floating behind the one he sees. 


saying that he has “no way of disproving their 
ideas”. Metzinger is more forthright. “Henrik’s 
work speaks to the idea that there is no such 
thing as a soul or a self that’s independent of 
the brain,” he says. 

Ehrsson is now working to turn his illusions 
towards a practical goal: developing better arti- 
ficial limbs. Many people who have lost arms 
still feel the presence of their missing limbs, yet 
replacement hands feel alien. “We think that 
people might find it easier to use their pros- 
theses, or develop a better body image, if we 
can elicit the illusion of real ownership,” says 
Ehrsson. 

To achieve this, he has adapted the rubber- 
hand illusion. With amputees, Ehrsson stimu- 
lates small spots on the stump that trigger the 
feeling of phantom fingers. By stroking these 
spots in time with the corresponding parts 
of a robotic ‘cyberhand; Ehrsson and his col- 
leagues have successfully convinced amputees 
that they owned the metal limb’. 

The effect disappeared about 10 to 15 sec- 
onds after the stroking stopped, though, so the 
stimulation would have to be continuous to 
keep the illusion going. That is what Ehrsson 
is working towards now. “The idea is to have 
an advanced prosthetic hand with sensors in 
the fingertips and stimulators in the stump,” 
he says. Although other groups are working on 
similar sensory devices, Ehrsson says that his is 
different because he is attempting to precisely 
match up the feelings from the artificial fingers 
with stimulation in the corresponding part of 
the stump to generate the illusion of ownership. 

Ehrsson has even bigger ambitions. Own- 
ership illusions could help people to take 

control of entire alien 


> NATURE.COM bodies, both virtual and 
Forapodcastonthe robotic, in a way that 
out-of-bodyillusion, would afford a finer 
see: degree of control than 
go.nature.com/Sbhm4g the joysticks and other 
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controllers used to steer robots and avatars 
today. People who control robots, for exam- 
ple, would see from their machine's perspec- 
tive using goggles, steer using motion-capture 
suits and receive tactile feedback from gloves 
connected to sensors in the robot’s hands. As 
long as signals are transferred between human 
and machine within 100 milliseconds, Ehrsson 
predicts that “the full-body illusion will kick 
in”. And size should not matter. A surgeon 
could control a microscopic robot in a patient's 
body. Giant robots could repair broken oil rigs 
or fractured nuclear power stations. Ehrsson 
smiles as he considers the possibilities. 

There is one illusion that even Ehrsson is 
unsure if he can pull off: splitting the sense of 
self, so that a person can commandeer two bod- 
ies. Ehrsson thinks that the trick’s feasibility may 
depend on how the brain integrates information 
from the senses. “Maybe it’s statistical in nature. 
Maybe instead of using absolute coordinates, 
the brain says it’s most likely that my hand is 
here,’ he says. “Ifit works like that, maybe you 
could trick the brain into thinking that this body 
and that body are equally likely to be me.” 

It sounds far-fetched, but so does most of 
what Ehrsson has achieved so far. “We're work- 
ing on it,” he says. “Then again, it might be 
impossible. = 


Ed Yong is a freelance writer based in London 
who writes the blog ‘Not Exactly Rocket 
Science’. 
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China’s unmanned spacecraft Shenzhou-8, launched in November, aims to dock with the Tiangong-1 space lab module launched in September. 


Asia’s space race 


Greater global cooperation is needed to avert the risk of 
further militarization, says James Clay Moltz. 


sian nations are in the midst of an 
Aunrsien space race. Although 

China has captured most of the 
attention, Japan, India, South Korea and 
other countries in the region are also 
expanding their space programmes. Unlike 
Europe, however, where countries are 
cooperating within the European Space 
Agency (ESA), Asian nations are going it 
alone. 

Space activity’s close links to national 
prestige and military operations are part of 
the problem. Like the superpowers of the 
1960s, Asian officials believe that space pro- 
grammes will bring them status at home and 


abroad. Asian nations do not want to be seen 
as ‘backward’ or, worse, as falling behind 
their neighbours in science and technology. 
So they watch one another's space accom- 
plishments closely, and try to match their 
rivals at every turn. 

Tensions in Asian politics underlie today’s 
space competition. Long-festering historical 
and geopolitical feuds have created hostile 
dyads throughout 
Asia: China-India, 


India-Pakistan, North Readmore about 
Korea-South Korea, _ India’sriseto the 
China-Japan and _ Moon: 


Vietnam-China, as 
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well as lesser rivalries in Southeast Asia. 
Unlike Europe and the cold-war-era United 
States and Soviet Union, Asia has no legacy 
of regional security cooperation, much less 
arms control. 

Asia’s emerging space race is spurring 
technological advances in the region. But 
competing national agendas are foster- 
ing scientific duplication, a failure to pool 
resources, political mistrust and increas- 
ing military tensions. These are worrisome 
trends that are also undercutting accom- 
plishments made by the United States, Russia 
and Europe in space cooperation. The big- 
gest fear among military analysts is that 
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> Asia's civilian space race will turn into an 
arms race. Greater cooperation is needed to 
avert an impending confrontation. 

Asia’s rise in space can be tracked in the 
arithmetic of recent launch activity (see 
‘Asia’s space leaders’). Last year, for the first 
time, the number of launches by China 
equalled that of the United States at 15, with 
only Russia ahead of them. In addition, India 
carried out three and Japan two. Major Asian 
countries are likely to increase launch act- 
ivity as they attempt to deploy constellations 
of precision-navigation and timing satellites 
for both civilian and military uses. 

A recent flurry 
of Moon missions 


highlights the Asia’s space 
competitive goals relations cannot 
of Asia’s space bedivorcedfrom 
players. Japan, thebroader 
China and India political climate 


have all conducted 
independent 
lunar-mapping 
programmes (Kaguya, Change and Chan- 
drayaan, respectively) since 2007. All three 
countries are planning follow-up missions, 
including landers, instrumented rovers and 
lunar bases. Despite this scientific redun- 
dancy, no one wants to cooperate and share 
the prestige from these flights. 


in the region.” 


RACE TO THE TOP 

Although science matters, human space- 
flight garners more attention. Witness the 
international tumult following the flight of 
China’s Shenzhou-5 in October 2003. With 
subsequent flights of two and then three 
taikonauts — the latter flight including a 
spacewalk — China has put others in Asia 
on notice. Its September 2011 launch of 
Tiangong-1, the first orbital test module for 
a space station planned for 2020, has raised 
the bar even higher. 

Japan has more experience in human 
spaceflight, with 15 manned missions since 
1992. It is a member of the International 
Space Station (ISS), to which it contributed 
the Kibo research module. Yet all of its 
flights have been on the US space shuttle 
or the Russian Soyuz. Japan’s H-II Transfer 
Vehicle is now bringing cargo to the ISS. It 
could be adapted to carry people, but Japan 
has not yet committed itself to developing 
the necessary life-support and re-entry tech- 
nology. 

India has felt forced to defend its space 
reputation in the face of China’s rise. It has 
recently announced that it will launch its 
own astronauts into orbit by 2016. In prepa- 
ration, it is working with Russia to train 
a cadre of astronauts and to acquire the 
necessary technologies. But this will be an 
expensive and risky undertaking. 

Beyond human spaceflight, all of Asia’s 
major space-faring countries are seeking to 


develop supplements to the US Global Posi- 
tioning System (GPS). China is again in the 
lead, having launched nearly one-third of its 
planned 35-satellite Beidou constellation. 
Like the GPS, Beidou will have both civilian 
and military applications. 

Japan is in the process of building a three- 
satellite GPS-augmentation system called 
Quasi-Zenith, which will reach throughout 
northeast Asia. It is seeking buy-in from 
South Korea to manage costs. 

India is planning a satellite network, called 
the GPS-Aided Geo Augmented Navigation 
System, to provide services in South Asia. 
Although there is some cooperation by these 
countries with the United States, Europe and 
Russia in this field, none is planned among 
the Asian systems. 

Smaller countries are also joining the 
fray. Malaysia piggybacked on its purchase 
of Russian MiG-31 fighter planes to have its 
first astronaut trained by Russia and flown to 
the ISS in October 2007. Its rival Singapore 
is sending engineers and military officers 
to the United States for training to develop 
telecommunications and remote-sensing 
capabilities. Singapore's first domestically 
produced X-Sat microsatellite achieved orbit 
aboard an Indian booster in April. 

Meanwhile, Indonesia has a well-dev- 
eloped satellite-communications network 
built with assistance from US and European 
companies, and in 2007 launched a remote- 
sensing system, built with Germany, aboard 
an Indian Polar Satellite Launch Vehicle 
rocket. Given its favourable equatorial loca- 
tion, Indonesia plans to develop its own 
launch capability by 2014. 

Thailand operates several large com- 
munications satellites covering Southeast 
Asia. It is working with France to develop an 
Earth-imaging satellite and has cooperative 
projects with both China and Japan. 

Vietnam has constructed a system of 
ground stations to support its US-built 
Vinasat-1 communications satellite and is 
developing its own research satellite, called 
Pico Dragon. Backed by Japanese develop- 
ment funding, Vietnam is also building a 
national space centre and an educational 
programme to support it. 

Taiwan keeps tabs on China through a 
unique cooperative arrangement giving it 
access to an advanced Israeli imaging sat- 
ellite as the craft passes through Asia. By 2014, 


ASIA’S SPACE LEADERS 

Country Space budget (yearly US$) 
Japan $3.8 billion 

China $2.2 billion* 

India $1.3 billion 


South Korea $220 million 


Taiwan will have its own domestically built 
satellite, Formosat-5, for reconnaissance and 
Earth imaging. 

The Australian government has recently 
established a new space organization. Its 
2009 Defence White Paper announced a 
major push into space, including plans to 
develop synthetic-aperture radar capabil- 
ity. Australia has purchased entry into the 
US Wideband Global Satcom military- 
communications constellation, and in 2010 
signed an agreement with the United States 
for enhanced cooperation in space tracking. 

On the Korean peninsula, South Korea is 
using Russian Angara boosters in an effort 
to beat rival North Korea into space. In 2009, 
Yi So-yeon, a young female chemist, became 
the country’s first astronaut, visiting the ISS 
aboard a Russian booster. South Korea has 
also developed infrastructure for satellite 
manufacturing and remote sensing. 

Not wanting to lag behind, North Korea 
has twice tried to use its missile programme 
to place a primitive satellite into orbit, both 
times without success (although domestic 
propaganda has trumpeted otherwise). 


RISING TENSIONS 
Although most of Asia’s space participants 
have focused on developing commercial 
space applications and military-support 
programmes such as reconnaissance, com- 
munications and navigation, the region's 
largest militaries are moving into weapons 
applications, causing global concern. 

China’s use of a ground-based inter- 
ceptor to demolish one of its old satellites in 
January 2007 created more than 3,000 pieces 
of hazardous orbital debris, most of which 
will be in orbit until around 2050. Numerous 
spacecraft — including Chinese satellites — 
have already had to move to avoid this debris. 

In response to this perceived anti-satellite 
threat, India has formed the Integrated Space 
Cell to manage its future military space assets, 
including satellites pledged to each of its mil- 
itary branches. The Singh government has 
said that it will develop anti-satellite weapons 
through a direct-ascent, missile-defence pro- 
gramme. This shift from its prior, exclusive 
focus on the civilian Indian Space Research 
Organisation marks a significant change in a 
decades-long peaceful space policy. 

Until recently, all of Japan’s space activi- 
ties had to be ‘civilia’ in nature, according 


Civilian space personnel | Launches per year 


8,300 2S) 
80,000* 10-15 
32,000* aS) 
2,500* 0-1 


*Estimated 
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Above a cloud-covered Earth, Japan’s H-II Transfer Vehicle (left) successfully docked with the International Space Station in September 2009. The following 


year, South Korea’s Naro-1 rocket (right) failed for the second time to launch, crashing just minutes after blast-off in Goheung. 


to a law passed by the National Diet in 1969. 
But China's military space activities and the 
desire to monitor North Korean military dev- 
elopments drove the Diet, in 2008, to legislate 
in favour of military space activities for the 
first time. 

Some senior Japanese officials now state 
that space-based weapons are conceivable 
for the nation, provided they are ‘defensive 
in nature. Japan recently announced a boost 
in its military space spending — for recon- 
naissance and early warning — despite 
ongoing budget woes after the earthquake 
and tsunami in March. 

Ironically, as Asian space tensions have 
mounted, programmes elsewhere have 
grown less competitive. The United States 
and Russia cooperate extensively on the ISS 
and in commercial launch ventures. But no 
such peer cooperation exists among Asia's 
space rivals; instead, their links are skewed. 
Asia’s leading space-faring countries engage 
with the United States, Europe and Russia to 
acquire high-level technology and training, 
but they prefer to work with less-developed 
countries to promote their own leadership, 
economic and security interests. 

For instance, China hosts the Asia-Pacific 
Space Cooperation Organization (APSCO), 
which began operations in 2008. The group 
coordinates joint space-science and civil- 
applications research projects such as the 
Small Multi-mission Satellite, which is used 
for Earth monitoring. 

China hopes that the APSCO will even- 
tually become like ESA, but, to date, its 
membership comprises the less-developed 
countries of Bangladesh, Iran, Mongolia, 
Pakistan, Peru and Thailand. 

Japan operates a parallel, but less-formal, 


organization called the Asia-Pacific Regional 
Space Agency Forum (APRSAF). The for- 
um’s activities include yearly conferences, 
training and cooperative scientific pro- 
grammes such as the Sentinel Asia project 
for regional disaster warning and moni- 
toring. The APRSAF has many regional 
participants — including a few Chinese 
institutes — but it is also a way to promote 
Japan’s regional interests. There are no 
cooperative projects between the APSCO 
and the APRSAF. 


MORE COLLABORATION 

What can be done? Asia’s space relations 
cannot be divorced from the broader poli- 
tical climate in the region. But there are solid 
technical, economic and environmental rea- 
sons for enhanced regional cooperation in 
space. What is lacking is political will. 

Some data exchanges and networking 
have begun for disaster monitoring and 
tsunami warning, given recent natural cat- 
astrophes in Asia. This could be an area for 
enhanced cooperation between the APSCO 
and the APRSAE Lunar research is another 
promising area. China, Japan and India are 
participants in the NASA-hosted Global 
Exploration Strategy and have expressed 
interest in joining the data exchanges 
planned under the International Lunar 
Network; a joint mission or hosted payload 
relationship could serve to reduce tensions 
and foster stronger linkages. 

Further forward, a human spaceflight 
initiative by Japan or China — modelled 
on the US-Soviet Apollo-Soyuz docking 
in 1975 — could help to foster operational 
cooperation and personnel exchanges. Such 
contacts would promote transparency and 


personal connections among scientists. 

Most important is the reduction of military 
tensions. At present, the major players lack 
even a basic satellite non-interference pledge 
like the one that was built into US-Soviet 
relations in 1972. A regional initiative on this 
score could help to reverse today’s hostile 
trends. Similarly, the threat of orbital debris 
could provide the basis for talks on halting 
mutually damaging kinetic- and laser-weap- 
ons tests. 

As an experienced and interested obser- 
ver, the United States might have a part to 
play. Increased efforts to implement the 
US government's 2010 call for ‘responsible 
behaviour’ in space could bea starting point, 
perhaps with the United States hosting a dia- 
logue among major space-faring nations. 
Although the Obama administration faces 
opposition in the US Congress, initiatives 
with China in lunar research, human 
spaceflight or space-traffic management 
could be used to draw the focus of reluctant 
Asian actors away from regional rivalry and 
towards shared, ‘humankind’ interests and 
safer norms of conduct in space. 

The present course of Asian space activity 
risks a future collision, both literally and 
figuratively. Unfortunately, absent a disaster 
that affects all players or bold new leadership 
to promote cooperation, the situation seems 
likely to get worse before it gets better. m 
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Aspects of gaming cultures, such as players’ conventions, are unfamiliar to many members of the ethics committees that must judge web-based privacy issues. 


Open up online research 


Social media hold a treasure trove of information. But the secretive methods of ethics 
review boards are hindering their analysis, says Alexander Halavais. 


r | Vhe mass adoption of networked 
communication and an emerging 
culture of open sharing have pro- 

vided a boon for social scientists. They 

have opened a window through which 
social scientists can observe the experi- 
ences of individuals and groups that have 
been all-too-easily ignored by history, and 
present an opportunity for researchers to 
examine not just a few, but thousands of 
eyewitness accounts of an event. My own 
work, for example, has examined pro- 
tests on Twitter, how people use the Digg 
website to become part of an online com- 
munity, how political campaigners reach 
voters online and how blogs can contrib- 
ute to the spread of news. The US Library 
of Congress, which last year acquired the 
archive of all public tweets made since 
Twitter began in 2006, announced that 
it will open up access to the database, 
although it is not yet clear when or under 
what rules. Through social media, social 
science is entering an age of ‘big science’ 
Collecting public musings can create 
ethical dilemmas for researchers. The 
messages may be publicly available, but the 
individuals writing them might not realize 
just how public they are, and might be sur- 
prised when their words are preserved or 


placed in another context. When, in 2006, 
sociologists from Harvard University in 
Cambridge, Massachusetts, collected data 
from Facebook for a study on students’ 
friendships and shared interests, some 
people questioned whether the research- 
ers had sufficient permission to distrib- 
ute the data, and whether they had made 
them sufficiently anonymous. Similarly, 
when America Online (AOL) released a 
data set of some 20 million search terms 
from 650,000 anonymized users for the 
purposes of research, also in 2006, jour- 
nalists and others managed to link some 
individuals to strings of search terms. The 
scandal resulted in a class-action lawsuit 
and the resignation of AOL’ chief technol- 
ogy officer. 

Working on large-scale public conversa- 
tions is new ground for many researchers, 
and for most research ethics committees, 
also called institutional review boards 
(IRBs). What constitutes ethical conduct 
remains blurry. In the case of the Harvard 
Facebook study, the methodology passed 
ethics review, but still caused controversy. 
Equally troubling is the other side of the 
coin: many studies have been blocked by eth- 
ics review, even when they present minimal 
risk to the participants. 
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IRBs were initially created as a way to 
review medical research in the United 
States, protect against ethical misconduct 
and ensure that participants were made 
aware of the risks associated with medi- 
cal experimentation. By the late 1960s, the 
boards were also overseeing privacy risks 
in social-science research, prompting the 
prominent US anthropologist Margaret 
Mead to argue to the National Institutes 
of Health that her field did not work with 
“subjects”, but rather with “informants in an 
atmosphere of trust and mutual respect”. By 
the mid-1990s, particularly in the United 
States, IRBs were overseeing groups that 
they used to ignore: historians, journal- 
ists and folklorists, for example. As Laurie 
Essig, a sociologist at Middlebury College 
in Vermont, put it in a blog post in August 
in the Chronicle of Higher Education: “IRBs 
have treated speaking with someone as 
equivalent to experimenting on them and 
have almost killed fieldwork in the pro- 
cess”. The US model is becoming the global 
norm, as IRBs elsewhere begin to review 
studies being done in the social sciences 
and humanities. 

The time and expense of intensive eth- 
ics review of online social science puts the 
brakes on such work: both slowing down 
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research and restricting the sharing of data. 

Steps are being taken to resolve these 
issues, but more needs to be done. Journals 
and funding agencies can, and should, help 
by making ethics reviews more transparent. 
This will demystify the ethics considera- 
tions behind such work so that researchers 
can learn from the successes of others. It 
will also encourage companies and the pub- 
lic to entrust their personal information to 
scientists. IRBs, for their part, should not 
apply the same standards used for studies 
that entail risks to health to those that may 
just have privacy issues, particularly when 
the societal perception of online privacy is 
in flux. 


WAR STORIES 

Working social scientists will agree that 
research conduct must be ethical, but few 
will extol the virtues of the IRB. War sto- 
ries abound: a protocol held up because 
someone on the committee felt that the 
area of research was fruitless, or because 
of a spelling error, or because the research 
might have brought too much controversy 
to the campus. Even when the issues are 
more central to the protection of people, 
the standards of approval are often ambigu- 
ous and informal. High levels of scrutiny 
are clearly necessary for a drug trial. But 
scrutinizing whether gamers would be 
traumatized by being asked questions about 
dressing up as characters for conventions 
— to take an example from my students’ 
research — is an issue best addressed by 
the researchers, who have had much more 
exposure to the participants and the culture 
being examined. 

For those who research online interac- 
tions, it can be especially frustrating to have 
aboard filled with members who have never 
used Facebook or played World of Warcraft. 
Although IRBs can, and sometimes do, bring 
in experts who can address the context of the 
research more directly, this happens more 
rarely than it should. 

Members of IRBs tend to be more 
comfortable with some methods than oth- 
ers: the hypothesis-driven experiments of 
the psychology lab rather than the inductive 
work done by ethnographers, for example. 
The decisions of IRBs seem to be idiosyn- 
cratic and, by extension, capricious, espe- 
cially when multi-site research is approved 
by several boards, yet held up by others. In 
the case of a colleague, each of two review 
boards insisted on having the other approve a 
protocol first. This can easily lead to research 
gridlock, and has spawned a growing indus- 
try of professional ethics-review expediters. 

Calls have continually been made to 
improve the ethics review process. The 
Institutional Review Blog (www.institution- 
alreviewblog.com) chronicles the overzeal- 
ous restrictions of IRBs in the humanities 


and social sciences, for example. The largest 
organization of ethics review profession- 
als, Public Responsibility in Medicine and 
Research, provides a venue for discussing 
ways to improve the process. Perhaps most 
promisingly, in July, the federal agency that 
oversees IRBs in the United States — the 
Office for Human Research Protections — 
announced its intention to revise the rules 
for research that poses only a minimal risk to 
people. The proposed changes should reduce 
many of the burdens currently imposed on 
researching open discourse on the web — 
such as studying blog posts or tweets. It is, 
however, unlikely to make standards of pri- 
vacy clearer to researchers, students or those 
who host participatory websites. 

After researchers have battled their way 
through the IRB process, the result, an IRB 
approval form, is usually tucked away in a 
drawer. Students are often introduced to this 
secretive process in a cursory overview of 
ethics in a methods course, and sometimes 
by being assigned to handle the IRB process 
for a project. Both good and bad models 
of ethical research are difficult to come by 
for students, particularly when the work is 
treading less trammelled ground: ethno- 
graphic research of virtual communities, 
for example. 


FIXING THE SYSTEM 

The solution is not to do away with the IRB, 
but rather to make amendments that render 
its dysfunctions less acute. 

The first step is to agree on a reasonable 
threshold for when IRBs should become 
involved — although federal oversight bod- 
ies set a minimum standard, institutions 
have some discretion on their specific poli- 
cies. Some universities, in an overabundance 
of caution, require work that incurs even the 

most minimal risk 


“The decisions to be reviewed by 
of IRBs seem to an ethics commit- 
be idiosyncratic tee before it can 
and, by commence, often 
extension, leading to months 


of delays. And not 
all committees con- 
sider research that 
involves text — such as tweets or blog entries 
— to be in need of ethics oversight. This lack 
of clarity and consensus results in unneces- 
sary oversight, taxing boards that could spend 
their time better on work that presents a sig- 
nificant risk to vulnerable people. When risks 
are to adults’ privacy only, for example, an IRB 
should not need to review the protocols. 

A complementary important step is to 
ensure that upcoming students, in all fields, 
have adequate ethics training. This will help 
to make certain that research not requiring 
IRB approval still has ethical thought behind 
it, and discourage the notion that handing a 
proposal over to an IRB excuses researchers 


capricious.” 


from considering the ethics themselves. 
Ethics thinking should begin when a study is 
first being designed and continue through to 
its completion and publication, rather than 
constituting a single bureaucratic hurdle. 

The greatest problem faced by the ethics 
system is secrecy. Review boards must make 
decisions with limited access to previous 
cases. Their decisions are rarely available to 
other IRBs or to researchers who could make 
productive use of their precedents — partic- 
ularly in new areas of research such as online 
social science, in which review boards tend 
to have little experience and would therefore 
benefit most from the experience of others. 
One solution would be to require IRBs to be 
transparent in their decision-making. This, 
however, seems unlikely to succeed. The IRB 
system is conservative by design; we must 
look elsewhere for change. 

Most government funding agencies and 
private foundations prominently promote 
open data sharing and collaboration. Were 
they to make the open publication of IRB 
protocols or ethics reflections a requirement 
for receiving funding, it would provide a 
crack in an otherwise too-secretive pro- 
cess. Many journals expect social-science 
research to have been inspected by an 
IRB before submission; a few even require 
authors to sign a statement of IRB approval. 
None requires that the approved protocol be 
provided to the journal or published. They 
should. Although some information would 
need to be redacted from such documents 
— including anything that might violate the 
privacy of participants or of the researchers 
— they would open a new window on ethics 
considerations. 

Enacting these changes would not only 
grease the wheels of social-science research, 
but also help to convince the public that the 
work is both important to their well-being 
and being done in a trustworthy way. That, 
in turn, might make companies more will- 
ing to share their data with science. At pre- 
sent, researchers find it difficult to study the 
large data sets from Facebook or Twitter, for 
example, without partnering up with some- 
one within those companies; the sites’ terms 
of use prevent outsiders from ‘scraping’ large 
data sets. 

By moving the consideration of ethical 
conduct beyond the localized IRB to the 
wider research community, we can evolve 
the kinds of standards and best practices that 
can serve to instruct not just the scholarly 
world, but also the wider realms of govern- 
ment policy and corporate practice. m 
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The sense of smell is essential, coming into play in everything from personal identity to assessing the quality of foods such as this pecorino cheese. 


Scent and sensibility 


Chris Loss savours a wide-ranging exploration of flavour that takes in evolution and 
physiology — and suggests it could be key to a healthier future. 


r | he most important apparatus in 
the culinary arts is not cutting- 
edge kitchen equipment or exotic 

ingredients; it is the senses. As neuro- 

scientist Gordon Shepherd describes in 

Neurogastronomy, it’s through the senses 

that we translate the size and configuration 

of volatile molecules into qualitative mean- 
ing, from biochemical into flavourful. 

In this succinct yet informative book, Shep- 
herd distills more than 50 years of research 
— and the collective knowledge of scien- 
tists, gastronomes, chefs, anthropologists 
and artists — to improve our understanding 
of flavour. He describes the physiology and 
neuro-circuitry of eating in compelling detail, 
delving deeply into the brain’s systems for fla- 
vour detection. After introducing the science, 
he provides snapshots of seminal academics 
and culinary figures past and present. 

Reading Neurogastronomy has convinced 
me that the answer to the question of why we 
are here is ‘flavour’ After all, the largest gene 
family in mammals codes for the primary 
determinants of flavour: the olfactory recep- 
tors. These genes comprise around 3% of the 
whole human genome — a greater percent- 
age than that of immunoglobulin and T-cell 


fs — receptors combined. 
NEUROGASTRONOMy Shepherd starts 
| a with the basic science 
ec of smell and taste. 
hee rr Smell, he explains, is a 
| hé E, dual sense: orthonasal 


Geen and retronasal. When 
a hey | you take a sniff of 
_ 4 something, such as by 
poking your nose into 
a glass of wine, that is 
orthonasal smelling. 
Retronasal smell- 
ing is more involved. 
When you swallow 
food or drink, you 
close your mouth and 
exhale through your 
nose. A puff of air is then pumped past the 
olfactory epithelium in the upper nasal cav- 
ity, where the olfactory receptors are embed- 
ded. Volatile chemicals in the air ‘tickle’ the 
receptors, which generate a signal in the 
olfactory bulb. This is converted into an 
‘odour image’ that the brain recognizes as 
an aroma. As Shepherd explains, retronasal 
smell has the bigger role in flavour. 
We learn that flavour is a multi-modal 


Neurogastronomy: 
How the Brain 
Creates Flavor and 
Why It Matters 
GORDON M. 
SHEPHERD 

Columbia University 
Press: 2011. 288 pp. 
$24.95, £16.95 


176 | NATURE | VOL 480 | 8 DECEMBER 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


perception: other senses come into play too. 
Mouth-feel — sensing the texture of foods 
through receptors inside the cheek, palate and 
tongue — can range from springy, chunky 
and stiff to gritty, astringent and painful 
(burning, sticking or aching). Hearing and 
sound are also important in relation to food 
texture and enjoyment of flavours: studies 
have found that crunching raw carrots pro- 
duces a pitch of 1-2 kilohertz, whereas eating 
crispy flatbread produces more than 5 kHz. 
We generally prefer “clearer and louder” food 
sounds, says Shepherd, who speculates that 
we interpret them using the brain’s neocortex. 

Beyond the immediate thrill of tasting 
chocolate, Thai curry or a red Bordeaux, 
flavour is an intimate part of ourselves, says 
Shepherd. Through it we connect to our 
environment, learn about our individuality 
and even gain insight into the origins of the 
human condition. 

The book quotes anthropologist Richard 
Wrangham, who states that the advent of 
cooked food was key 


to human evolution: ONATURE.COM 
it required less energy  Formoreonthe 
to chewand increased _ science of scent: 


the bioavailability of — go.iature.com/z/Gmvli 
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nutrients. The energy used to physically and 
chemically break down food could instead be 
diverted towards brain development, he says. 
Shepherd posits that our retronasal olfac- 
tory apparatus — and the flavours detected 
through it — allowed for detection and iden- 
tification of a broader diversity of nutrients. 
This process, he says, ultimately led to the 
establishment of cuisines, which, apart from 
their national and cultural connotations, are 
one of the key ways in which humans success- 
fully adapt to their environments. 

Understanding that flavour is largely a 
creation of the brain, Shepherd argues, is cru- 
cial to addressing food-related health issues 
facing society today, such as lowering caloric 
intake and reducing sodium content without 
compromising flavour. When the nutritional 
sciences are applied in the food industry, the 
focus is often on the effects of including or 
excluding certain nutrients for health. But 
with improvements in nutritive heft must 
come improvements in flavour. The emerg- 
ing field of neurogastronomy recognizes that 
requirement, and Shepherd is just one voice 
in a growing chorus calling for more col- 
laboration among flavour experts, including 
chefs, neuroscientists and biochemists. 

The book highlights key culinary figures 
past and present. Notably, the nineteenth- 
century French epicure and politician Jean 
Anthelme Brillat-Savarin, author of The 
Physiology of Taste (1825), pops up through- 
out to remind us that what we eat fosters criti- 
cal evaluation of the natural world. Food and 
its flavours were the main point of inquiry 
for Savarin, and he pioneered some of Shep- 
herd’s major themes, basic concepts and 
underlying mechanisms. Along with modern 
gastronomic heroes such as French chef Paul 
Bocuse, we encounter US food-science writer 
Harold McGee, who has inspired and guided 
chefs towards more evidence-based cuisine, 
and has investigated the role of flavour chem- 
istry in the science underlying deliciousness. 

Shepherd ends with an adept deconstruc- 
tion of the famous sensory journey of Marcel 
Proust in his book Remembrance of Things 
Past — specifically, the tisane of lime-flower 
tea and the madeleine cake that sets Proust 
off on his epic memory-fest and reveals the 
hard-wired connectivity of flavour stimuli to 
memory, emotion and critical thought. 

If flavour is the trigger for an act of the 
imagination as extraordinary as Proust's mag- 
num opus, what else might it do? Shepherd 
makes an excellent case for neurogastronomy 
as an important cross-disciplinary field that is 
likely to motivate a variety of imperatives for 
our health and well-being. m 


Chris Loss is a food scientist and director 
of menu research and development at The 
Culinary Institute of America, Hyde Park, 
New York 12538, USA. 
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Books in brief 


The Life of Super-Earths: How the Hunt for Alien Worlds and 
Artificial Cells Will Revolutionize Life on Our Planet 

Dimitar Sasselov BASIC Books 240 pp. $24.95 (2012) 

Planet hunters have been busy bagging their quarries since 1995: 
known extrasolar planets now number around 600. In this short, 
sharp look at the subset called ‘super-Earths’ — rocky or oceanic, 
but more massive than Earth — astronomer Dimitar Sasselov gives 
us the science and the speculation about life on other worlds. He 
suggests that the Copernican revolution, which demoted Earth from 
its position at the centre of everything, could be brought full circle by 
new findings from synthetic biology and planetary science. 


Babel No More: The Search for the World’s Most Extraordinary 
Language Learners 

Michael Erard FREE PRESS 320 pp. $25.99 (2012) 

Polyglottism has always amazed. But are hyperpolyglots — who 
speak many languages — neurological oddities, swots or genetically 
predisposed? Writer Michael Erard’s tale of tongues includes such 
figures as Bolognese priest Giuseppe Mezzofanti, alleged master of 
AO languages, and British explorer Richard Burton, who spoke 29 
(and 11 dialects). Erard examines sections of multilinguists’ brains 
and tracks down today’s hyperpolyglots. He concludes that they are 
a “neural tribe” — ambitious and willing to reshape their brains. 


Golden Holocaust: Origins of the Cigarette Catastrophe and the 
Case for Abolition 

Robert N. Proctor UNIVERSITY OF CALIFORNIA PRESS 779 pp. $49.95 (2012) 
A century of hype has made tobacco the most popular drug on 
Earth. Although smokers may feel they are working the glamour 

of film noir, the facts about this lethal habit just get grimmer. 

For his monumental and sobering indictment, science historian 
Robert Proctor dug through piles of recently released industry 
documentation to uncover the activities that lured many scientists 
into its mill of denial. A tale of giant profits, decades of secrecy over 
the links with cancer, useless filters and more. 


The Sacred Headwaters: The Fight to Save the Stikine, Skeena, 
and Nass 

Wade Davis and Carr Clifton GREYSTONE Books 160 pp. $45 (2012) 
Splayed next to southern Alaska, Canada’s Sacred Headwaters 
region is a vast panorama of mountains, salmon rivers and canyons 
criss-crossed with the trails of caribou, grizzlies and mountain goats. 
Thousands of First Nations people live there. But as anthropologist 
and ethnobotanist Wade Davis explains, it could become a war zone. 
Corporations are queuing up to develop the region, halted only by 
First Nations activists and a government moratorium that lasts until 
2013. Carr Clifton’s haunting photographs evoke what’s at stake. 


The End of Illness 

David B. Agus SIMON & SCHUSTER 352 pp. £14.99 (2012) 

The billions of dollars spent on medical research have failed to 
vanquish cancer and other serious diseases. Solutions, argues 
medical oncologist David Agus, depend on a systemic, whole-body 
approach rather than a reductionist one. Emphasizing prevention and 
tailored treatment, Agus advocates the combination of techniques 
such as proteomics with pragmatic routines based on robust 
research. Avoiding risky habits and introducing lifestyle changes such 
as taking aspirin are small steps that could pay dividends, he says. 
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nutrients. The energy used to physically and 
chemically break down food could instead be 
diverted towards brain development, he says. 
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its flavours were the main point of inquiry 
for Savarin, and he pioneered some of Shep- 
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gastronomic heroes such as French chef Paul 
Bocuse, we encounter US food-science writer 
Harold McGee, who has inspired and guided 
chefs towards more evidence-based cuisine, 
and has investigated the role of flavour chem- 
istry in the science underlying deliciousness. 

Shepherd ends with an adept deconstruc- 
tion of the famous sensory journey of Marcel 
Proust in his book Remembrance of Things 
Past — specifically, the tisane of lime-flower 
tea and the madeleine cake that sets Proust 
off on his epic memory-fest and reveals the 
hard-wired connectivity of flavour stimuli to 
memory, emotion and critical thought. 

If flavour is the trigger for an act of the 
imagination as extraordinary as Proust's mag- 
num opus, what else might it do? Shepherd 
makes an excellent case for neurogastronomy 
as an important cross-disciplinary field that is 
likely to motivate a variety of imperatives for 
our health and well-being. m 
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research. Avoiding risky habits and introducing lifestyle changes such 
as taking aspirin are small steps that could pay dividends, he says. 
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PHILOSOPHY 


Meditation in mind 


Daniel Stoljar contemplates two contrasting takes on 


science and Buddhism. 


uddhism is a distinctive world reli- 
Bee: It lacks an ‘omnigod’ — an 

omnipresent, omniscient, all-powerful 
creator — anda notion of humans as com- 
plexes of physical bodies and souls that 
ascend to heaven after death. Could it be min- 
gled with the scientific culture of the twenty- 
first century to produce a new philosophical 
outlook on the world, the mind and our val- 
ues? That idea lies at the heart of these two 
contrasting books on Buddhism and science. 

In The Bodhisattva’s Brain, philosopher of 
mind Owen Flanagan wants to change Bud- 
dhism to fit better with the scientific world 
view. In Meditations of a Buddhist Skeptic, 
Buddhist scholar and monk B. Alan Wal- 
lace wants to alter the scientific world view 
so that it meshes better with Buddhism. Both 
tease out the contrasts between Buddhism 
and materialism — and both stumble when 
it comes to defining the latter. 

For Flanagan, the world is fundamen- 
tally physical, and thus explicable by natural 
science, at least in principle. But Buddhism 
seems inconsistent with this materialistic 
world view. It may have no place for a crea- 
tor god or ascending souls, but as Flanagan 
notes, it is “opulently polytheistic insofar as 
spirits, protector deities, ghosts, and evil spir- 
its abound”. He points out, too, that in east 
and southeast Asia, a belief in rebirth among 
Buddhists is as common as a beliefin heaven 


The Bodhisattva’s Brain: Buddhism 
Naturalized 

OWEN FLANAGAN 

MIT Press: 2011. 272 pp. $27.95, £19.95 


Meditations of a Buddhist Skeptic: A 
Manifesto for the Mind Sciences and 
Contemplative Practice 

B. ALAN WALLACE 

Columbia University Press: 2011. 304 pp. 
$27.95, £18.95 


among North Americans. So Flanagan sets 
out to ‘naturalize Buddhism: to see what 
Buddhism would look like without the “hocus 
pocus” (as he cheerfully puts it). 

The result is a wide-ranging discussion of 
the neural and cognitive basis of mental states 
such as meditation and the achievement of 
enlightenment or nirvana, which are central 
to Buddhism. Flanagan outlines a plausible 
moral philosophy based on an idea that he 
takes from Aristotle but reinterprets in the 
light of Buddhist teaching: eudaemonia, a 
sort of happiness that, in Flanagan’s view, is 
the proper aim ofa good life. 

Wallace adopts a contrary view, even urg- 
ing the many scientists and philosophers 
who embrace materialism to change their 
minds. His bookis in part a compelling and 
clear statement of key Buddhist ideas, but 
its main point is to advocate a distinction 
between science and a materialist inter- 
pretation of it. Materialist science, Wallace 


thinks, cannot get to grips with the reality 
of consciousness, free will or values. Science 
inspired by Buddhist experience might. 

In both books, however, the concept of 
materialism remains blurry. Wallace notes 
what materialism isn’t: the ancient ‘atoms 
and the void’ notion of the Greek philosopher 
Democritus, which is inconsistent with mod- 
ern physics. But rather than clarifying what 
it is, he uses materialism as a placeholder for 
his dislikes — variously, a nihilistic rejection 
of moral values, a desire for more possessions 
and the denial of consciousness and the mind 
altogether. A more explicit statement from 
Wallace about what he is rejecting would have 
given his manifesto more muscle. 

Flanagan's discussion, in my view, has a 
similar flaw. In contemporary philosophy of 
the mind and science, materialism is some- 
times seen as the view that neuroscience, 
particularly at the cellular level, can explain 
consciousness. But Flanagan instead advo- 
cates what he calls “the natural method” for 
this task. Roughly, this research approach 
involves gathering subjective data, such 
as first-person narratives, and then using 
ideas or theories from cognitive and neural 
science to explain what is going on in the 
brains or minds of the conscious person. If 
that is materialism, it is hard to see anybody 
disagreeing with it, Buddhist or not. 

All this said, it is true that science has yet to 
produce good explanations of consciousness, 
value and free will. The suggestion brought 
to the fore by Flanagan and Wallace — that 
Buddhism may bea source of insight in these 
areas — is a welcome and tantalizing one. m 


Daniel Stoljar is professor of philosophy 

at the Australian National University, 
Canberra, ACT 0200, Australia, and author 
of Physicalism. 

e-mail: daniel.stoljar@anu.edu.au 


Buddhism, which reached Borobudur in Java, Indonesia, by at least the ninth century AD, could provide insight into consciousness and free will. 


178 | NATURE | VOL 480 | 8 DECEMBER 2011 


© 2011 Macmillan Publishers Limited. All rights reserved 


W. MARR/PANORAMIC IMAGES/ROBERT HARDING PICTURE LIBRARY 


M. ZIENKEWICZ 


Q&A Roald Hoffmann 
Chemical connector 


Theoretical chemist, poet and playwright Roald Hoffmann won a Nobel prize in 1981 for his 
work on how molecules change as they react. As the International Year of Chemistry comes to a 
close and he releases two books, Hoffmann talks about language, ethics and the sublime. 


Why edit a volume of essays about the 
sublime? 

Beyond the Finite is a collection by various 
authors writing on the role of the sublime 
in art and science, co-edited with Iain Boyd 
White. Most scientists think that the sublime 
is just emotional excess or, God forbid, the 
religious creeping into what we do. This is 
silly. Scientists are confronted with astound- 
ing novelty. Why not embrace the wonder one 
feels when looking at the ribosome, or at the 
beauty of a molecule shaped like a Ferris wheel 
or at a distant dust cloud across a galaxy? Why 
not accept that point at which one feels one 
really knows something that is deep and uni- 
versal? Why not be at peace with the sublime? 


What topics do you cover in the other book? 
Roald Hoffmann on the Philosophy, Art and 
Science of Chemistry, edited by Jeffrey Kovac 
and Michael Weisberg, collects my writings 
on a number of subjects. Among these are 
what goes on under the surface of an article 
in a chemistry journal, how the philosophy 
of science would be different if it was done by 
chemists, and strategies for improving teach- 
ing. I talk of the special relationship between 
handicrafts and chemistry. And I fight reduc- 
tionism all the way. 


When did you become a writer? 

I leapt into poetry from science at 40, then 
filled in the ground in between. I started 
writing essays and philosophical articles at 


Beyond the Finite: The Sublime in Art and 
Science 

EDITED BY ROALD HOFFMANN AND IAIN BOYD WHITE 
Oxford University Press: 2011. 208 pp. $24.95, £15.99 


Roald Hoffmann on the Philosophy, Art, and 
Science of Chemistry 

EDITED BY JEFFREY KOVAC AND MICHAEL WEISBERG 
Oxford University Press: 2012. 448 pp. $35, £22.50 


around 50, and plays after 60. My years of 
getting scientific articles rejected were good 
training for submitting poems. It took me 
seven years to get a poem accepted, and I 
still have difficulty getting them published. 


What do you research? 

Early in my career, I found a method to 
explain the ways in which electrons move 
in molecules. As a theorist, I have moved 
through chemistry and am now focusing on 
solid-state physics and conductors — and 
building bridges between these subfields. I 
have explained the shape, colour, spectra, 
electrical properties and reactivity of mol- 
ecules using simple ideas about how elec- 
trons move in their orbitals, and I have tried 
to predict molecules that haven't yet been 
made. Chemists will tell you that Iam good 
at devising verbal explanations and extract- 
ing understanding out of poor calculations. 


You call your work “applied theoretical 
chemistry”. What does that mean? 
Tm a theorist, but I take my inspiration from 
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experiment. I don’t build castles in the sky. 
I don't believe in working on ‘important’ 
problems. I would rather solve the many 
little puzzles that I see in the literature — 
strange reactivities, or why one molecule is 
bent when another is straight. Underneath 
is my philosophy that if I look at hundreds 
of smaller problems in chemistry, and keep 
in mind the connections that must be there, 
I will see the chemical universe. 


Should chemists take the blame for how the 
molecules they have created are being used? 
We should not take the blame. But as we 
create the new, we should ask whether our 
creation — be it a molecule, a gun or apoem 
— can hurt people. Without that question, 
the act of creation is incomplete. My play 
Should've tries to deal with this question. 
What responsibility do we have when, despite 
our best intentions, something we made that 
appears harmless, or even good, is abused by 
others? Not much, but some. Here we enter 
the realm of Greek tragedy. This is not to set 
scientists on a pedestal, but to make them 
see that they are no more absolved of ethical 
choices than other people. 


Your play Oxygen was staged across Europe 
this autumn. What does it portray? 

Written with chemist and author Carl 
Djerassi, Oxygen comes out of the story of 
one week in 1774 when Antoine Lavoisier 
was on the verge of understanding modern 
chemistry. All he needed was the discovery 
of oxygen. Suddenly he is faced with Joseph 
Priestley and Carl Wilhelm Scheele, who 
have each discovered the element but don't 
understand what they have discovered. How 
does Lavoisier react to that? He ignores 
Scheele and tries to take credit away from 
Priestley. Then there is Madame Lavoisier, 
who deserves her own opera. So, who dis- 
covered oxygen? We have these romantic 
notions of the past, but there were disputes 
over credit then, just as there are now. One 
has to admire the Nobel committee for hav- 
ing the moral strength to make its decisions. 


What keeps you interested in chemistry? 
Chemistry is closer to the human scale than 
physics. Neither subatomic particles nor the 
shape of the Universe will affect you much. 
But if you are prescribed a drug, you want it 
to work. Molecules are like humans in some 
ways: they can hurt or heal us. They are con- 
stantly moving, both natural and unnatural, 
wondrous in their complexity. A journalist 
once asked me when we will discover the 
next element. I told him that it is not about 
new atoms, but rather the molecules that a 
chemist can build from them. As with toy 
blocks, it is what the child’s imagination does 
with them that counts. m 


INTERVIEW BY JASCHA HOFFMAN 
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Correspondence 


Violence: the role of 
society and state 


I question Steven Pinker’s 
optimistic prediction of 

a continuing decline in 
interpersonal violence, which is 
based on supposed correlations 
of mortality data with improved 
reasoning ability and IQ (Nature 
478, 309-311; 2011). 

Even if we disregard possible 
biases in the mortality data and 
the problems of quantifying 
reason and IQ over long time 
spans or across cultures, 

a sudden shift in innate 
individual attributes since 
the mid-twentieth century is 
unlikely. Pinker’s contention 
that “modern societies have 
been getting smarter” makes 
a further dubious leap from 
an individual to a social 
intelligence. 

More-concrete social features 
should be considered. The 
societies in which the decline of 
interpersonal violence is most 
marked have become more 
organized and affluent, with 
less contention over resources, 
and violence has mostly been 
deferred to the state. We know 
that this deferral can lead to 
violent events that are more 
erratic and severe — and even 
ideological and irrational. 

Primitive terrorism and 
revenge can be blamed for tens 
of thousands of casualties in 
Iraq and Afghanistan. We have 
had the nuclear technology 
of violence since the Second 
World War. Occasions on 
which nuclear weapons were 
actually used (with some 
200,000 civilian casualties), or 
were close to being used (in 
the Korean and Vietnam wars 
and in Cuba), indicate that 
the statistics for mortality by 
violence are unstable owing to 
the escalation and sensitivity of 
trigger points. 

Rather than an irrational 
optimism, we should adopt 
a rational, risk-assessment 
approach that better recognizes 
the instability of organized and 
powerful social violence. 


Gary G. Nelson Troy, New York, 
USA. garygnelson@gmail.com 


Violence: intuition is 
important too 


Steven Pinker argues that 
humans have been short of 
rationality throughout most 
of history, and suggests we 
may now be putting that right 
(Nature 478, 309-311; 2011). 
But intuition — the gut feelings 
that range from raw instinct to 
common sense — is important 
too. Reasoning and intuition 
together lead to wisdom, but 
either without the other is 
dangerous. 

Intuition derives from 
various subconscious responses, 
presumably largely evolved, that 
lie at the root of our moral and 
aesthetic sense. A healthy mind 
conducts a perpetual dialogue 
between reasoning and intuition. 

Rationality without moral 
and aesthetic constraint has led 
us to nuclear war and eugenics 
and, more recently, to the 
crude logic of neoliberalism, 
industrialized agriculture and 
overconsumption. 

This, of course, is where 
religion comes in. Although it 
sometimes loses sight of its own 
raison détre, its core function is to 
refine our intuitions. 

Colin Tudge Oxford, UK. 
colin@colintudge.co.uk 


Europe must address 
research misconduct 


In the United States, the 
Office of Research Integrity 
and the National Science 
Foundation both impose legal 
sanctions for scientific fraud 
in publicly funded research. 
The European Commission 
has no equivalent measures in 
place. Strict regulations and 
policy procedures to combat 
misconduct (see, for example, 
go.nature.com/alznzb) 

are needed before Europe’s 
ambitious Horizon 2020 
research-funding programme is 


introduced in 2014 (Nature 478, 
16; 2011). 

At present, the programme 
contains no provision for 
scientific misconduct. Even 
the guidelines of the European 
Code of Conduct for Research 
Integrity make no detailed 
recommendations for dealing 
with fraudulent research (see 
go.nature.com/juubnq). 

Under the Horizon 2020 
proposals, grant applicants 
should sign an ethics declaration 
confirming that they abide 
by the European Code of 
Conduct and will cooperate in 
and support investigation into 
suspected violations and alleged 
misconduct. 

Creation of a European 
agency for research integrity 
would guarantee that any work 
funded by Horizon 2020 meets 
explicit standards. For example, 
misconduct investigations 
could be carried out by agency 
ombudsmen and by ad hoc 
research-integrity committees, 
based on the ethics review 
panels of the current Seventh 
Framework Programme for 
research. The agency would 
listen to whistleblowers and take 
over investigations impeded 
by institutional reluctance or 
by unmanageable conflicts 
of interest. It could also bar 
institutions from receiving 
future grants from the European 
Commission if they failed to 
cooperate. 

Xavier Bosch Department of 
Internal Medicine, Hospital 
Clinic, University of Barcelona, 
Spain. xavbosch@clinic.ub.es 


Caution over Iran’s 
nuclear plans 


Your assertion that the most 
recent International Atomic 
Energy Agency’s report on Iran‘s 
nuclear programme uses “old 
intelligence” to provide “the 
sharpest picture yet of the weapon 
that Iran hopes to develop” 
(Nature 479, 282; 2011) warrants 
some contextual explanation. 
Some analysts say that this 


intelligence dates from the 
turn of the millennium. This is 
because the latest US National 
Intelligence Estimates of Iran’s 
nuclear programme (one issued 
in 2007 and another earlier 
this year), which represent 
a consensus of all 16 US 
intelligence agencies, concluded 
that Iran ended its efforts to 
build a nuclear weapon in 2003 
(see go.nature.com/ms1sq7). 
Many of the same political 
elements that erroneously 
claimed that Iraq had nuclear 
weapons are the ones now 
insisting that Iran is trying to 
build a nuclear bomb. This time 
around, let’s be sure of the facts. 
John W. Grula Carnegie 
Observatories, Pasadena, 
California, USA. 
jgrula@obs.carnegiescience.edu 


Canny liars score by 
tricking themselves 


Robert Trivers makes another 
significant contribution to 
evolutionary and behavioural 
biology in his latest book about 
deceit and self-deception (Nature 
478, 314-315; 2011). 

The idea that self-deception 
is beneficial because it makes it 
easier to deceive others has also 
been noted by other observers 
of human nature. Writer Mark 
Twain declared: “When a person 
cannot deceive himself, the 
chances are against his being 
able to deceive other people,” 
and author Elizabeth Bear more 
recently advised: “The secret 
to getting away with lying is 
believing with all your heart.” 
Mark A. Davis Macalester 
College, Saint Paul, Minnesota, 
USA. davis@macalester.edu 


CORRECTION 

Late editorial changes to 

A. Lescroél and D. Grémillet’s 
Correspondence (Nature 479, 
299; 2011) imply that the Ross 
Sea is already under absolute 
protection against fishing, 
which it is not. 
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OBITUARY 


Norman Ramsey 


(1915-2011) 


Physicist behind precision spectroscopy and atomic clocks. 


physics, Norman Ramsey’s name is 

synonymous with a technique for preci- 
sion spectroscopy, which earned him a share 
of the 1989 Nobel prize. Beyond the lab, he 
was also respected as a scientific statesman, 
admired for his fairness and integrity, his 
energy and unfailing good humour. Over 
his long career he inspired generations 
of physicists, including the 84 graduate 
students whom he supervised. 

Ramsey, who died aged 96 on 4 Novem- 
ber, was born in 1915 in Washington DC, 
the son of an Army officer and a math- 
ematics teacher. He entered Columbia 
University, New York, in 1931, majoring 
in engineering and then shifting to math- 
ematics. On graduation, he studied phys- 
ics at the University of Cambridge, UK, 
working for two years alongside pioneers 
of nuclear physics such as J. J. Thomson, 
Ernest Rutherford and James Chadwick. 

Returning to Columbia in 1937, Ram- 
sey became a graduate student in the 
group of Isidor Rabi. Just a few months 
later, Rabi invented a revolutionary 
method for studying the structure of 
atoms and molecules, using streams of 
irradiated molecules in a vacuum. Ram- 
sey’s research blossomed: he shared in 
the discovery that the deuterium nucleus 
(a proton and neutron) is not spherical, 
implying that nuclear forces do not act 
along the line between the nuclei. 

Shortly after receiving his PhD in 1940, 
Ramsey joined the war effort at the MIT 
Radiation Laboratory in Cambridge, 
Massachusetts, developing radar systems; 
later he worked on the Manhattan Pro- 
ject. Following the war, Ramsey returned 
to Columbia. With Rabi, he helped 
found Brookhaven National Laboratory 
in Upton, New York, serving as the first 
chair of its physics department. In 1947, 
Ramsey moved to Harvard University, where 
he remained for the rest of his career as the 
Higgins Professor of Physics. 

At Harvard, Ramsey undertook a series 
of classic studies on nuclear and magnetic 
interactions within molecules, which 
helped to lay the foundations of the theory 
of ‘chemical shifts’ for nuclear magnetic 
resonance and magnetic resonance imag- 
ing, as well as revealing many properties 
of nuclei, such as their magnetism. He is 
best known for inventing the separated 
oscillatory field method — today known 


E the fields of atomic and molecular 


simply as the Ramsey method — in which 
particles are subjected to two pulses of tem- 
porally separated radiation, improving the 
resolution with which their energy struc- 
ture can be measured. The method is now 
ubiquitous in high-precision and quantum 
physics, and is an essential ingredient in 
many atomic clocks. 


In another breakthrough, Ramsey and 
his student and colleague, Daniel Kleppner, 
found a way to confine hydrogen atoms for 
a relatively long time, typically a second, 
thereby enabling the atoms to radiate into a 
resonant microwave cavity. The result was 
the hydrogen maser, which emits stable 
radiation at the atoms’ resonant frequency (a 
wavelength of 21 centimetres) and today has a 
key role in the Global Positioning System and 
in radio astronomy. The maser was also used. 
in 1980 to demonstrate precisely Albert Ein- 
stein’s prediction that clocks run at different 
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rates in different gravitational potentials. 

Asa scientific statesman, Ramsey served 
as first assistant secretary general for sci- 
ence in NATO in 1958 and 1959, establish- 
ing summer schools and fellowships that 
helped to educate a generation of post- 
war European physicists. He founded the 
Harvard cyclotron, and later chaired the 
Harvard-MIT Cambridge Electron Accel- 
erator. For 16 years he served as president 
of the University Research Association, 
which managed the construction and 
operation of Fermilab in Batavia, Illinois. 

During Senator Joseph McCarthy’s anti- 
communist witch-hunt of the early 1950s, 
Ramsey appeared on television to rebut 
attacks on his colleague Wendell Furry and 
on Harvard University. Although he disa- 
greed with Furry’s political views, he said 
that “true freedom gives a man not only 
the right to make a right choice but also the 
freedom to make a wrong choice”. In the 
1980s, he chaired the National Research 
Council committee that concluded that 
acoustic evidence did not support the idea 
that a second gunman was involved in the 
assassination of President John F. Kennedy. 

When I was one of Ramsey's graduate 
students, he was heavily occupied with 
Fermilab. Yet he seldom, if ever, missed 
the weekly group meetings with students 
at Harvard, always offering his encour- 
agement. Even in his later years, when 
he wound down his research group, he 
continued to lecture and participate in 
conferences, attending almost every 
session and engaging with students and 
senior scientists alike. 

Ramsey’s enthusiasm and energy were 
legendary. At a group outing to a beach 
ona cold and windy day during my grad- 
uate-school days, he was the only one in 
the water — trying out his surfboard. And 
in his eighties he trekked through Nepal 
with his wife, hiking for miles with a severed 
Achilles tendon. 

Norman Ramsey was a towering figure 
in physics. Through his accomplishments, 
community service, warmth and high ethical 
standards, he was an inspiration to many. = 


David Wineland is a physicist in the 
Time and Frequency Division of 

the National Institute of Standards 
and Technology (NIST) in Boulder, 
Colorado 80305, USA. 

e-mail: david.wineland@nist.gov 
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A radical approach to diversity 


Compounds containing the trifluoromethyl group have many uses, but their isomers must often be made using different 
multi-step routes. Two studies now show how several isomers can be made by the same route. SEE LETTER P.224 


ANDREW T. PARSONS & 
STEPHEN L. BUCHWALD 


he incorporation of fluorine atoms into 

organic molecules has had a profound 

impact on the development of new 
products for numerous industries. This is 
largely due to the useful properties that fluo- 
rine imparts to organic molecules, such as 
stability and lipophilicity, and the fact that 
it can modify molecular polarity. Among its 
many applications, fluorine incorporation has 
been used in pharmaceutical development to 
prevent molecules from being metabolized too 
quickly, thereby allowing a drug to act before 
it is cleared from the body’. Fluorine has also 
played an integral part in the development of 
agrochemicals and high-performance materi- 
als’. Two papers, one by Ji et al.’ in Proceedings 
of the National Academy of Sciences and the 
other by Nagib and MacMillan‘ on page 224 
of this issue, now report innovative methods 
for introducing an important fluorinated 
group — the trifluoromethyl group, CF, — 
into molecules. 

Appending fluorinated groups to small 
organic molecules has long posed a consider- 
able challenge to synthetic organic chemists. 
The trifluoromethyl group has been par- 
ticularly difficult to install, in part because 
the reactive intermediates that are generated 
during trifluoromethylation reactions are 
unstable under the conditions necessary for 
the reactions to proceed. The harsh protocols 
typically required for these reactions can limit 
the substrates that can be used and/or cause 
side-product formation. 

Reports highlighting efforts to use milder 
methods that avoid the common problems 
associated with trifluoromethylation chem- 
istry have therefore recently garnered great 
attention’. Although much progress has 
been made, many methods continue to rely 
on the use of ‘pre-functionalized’ reactants 
containing chemical groups that, in conjunc- 
tion with transition-metal catalysts or a pro- 
moter reagent, direct trifluoromethylations 
to particular sites in the reactant (Fig. la). 
Alternatively, structurally complex trifluoro- 
methyl-containing molecules can be made from 
simpler starting materials that already contain 
a trifluoromethyl group (Fig. 1b). But using 
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Figure 1 | Approaches for making trifluoromethylated compounds. a, One traditional approach 

for making compounds appended with trifluoromethyl (CF) groups involves making a precursor of 
each isomer that contains a surrogate group (X), which is then converted into a trifluoromethyl group 
at the end of the synthesis. A different multi-step synthesis is required to make each isomer; only one 
such route is shown. b, The second traditional approach starts from compounds that already contain a 
trifluoromethyl group. Again, a different multi-step synthesis is required to make each isomer, only one 
of which is shown. c, Two papers™’ now report a different approach: a structurally complex molecule is 
made, which is then converted into several isomers of the trifluoromethylated analogue in a single step. 
This strategy reduces the time and labour required to make several trifluoromethylated analogues of 
structurally complex molecules, and minimizes the overall waste produced. 


these methods comes at a price: the preparation 
of pre-functionalized and trifluoromethyl- 
containing starting materials is time-consum- 
ing and costly in terms of chemicals used and 
waste disposal, and overall is inefficient. Fur- 
thermore, when access to a diverse range of 
trifluoromethylated molecules is desired, each 
molecule requires its own pre-functionalized 
starting material to be prepared, amplifying 
the inefficiencies so that they become increas- 
ingly prohibitive. 

Despite their drawbacks, existing chemo- 
selective (site-selective) methods are often 
useful when the best trifluoromethylated 
molecule for a given application is known. 
But what happens when the optimal site for 
trifluoromethylation in a molecule is not 
known, as is often the case for those working in 
fields such as drug discovery and agrochemical 
research? In many instances, it is desirable to 
append trifluoromethyl groups to a molecule 
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at as many positions as possible, to make lots 
of different isomers. This approach maxi- 
mizes the diversity of the molecules obtained 
and increases the probability of discovering a 
molecule that has desirable chemical proper- 
ties and biological activity. 

The reports by Ji et al.’ and Nagib and 
MacMillan‘ highlight their efforts to develop 
versatile methods for the rapid preparation of 
a diverse range of trifluoromethyl-containing 
molecules. Both groups sought a general 
protocol that provides access to trifluoro- 
methylated molecules without the need for 
pre-functionalized substrates. They targeted 
a chemical transformation, termed C-H 
trifluoromethylation, in which the hydrogen 
of a carbon-hydrogen (C-H) bond is replaced 
with a trifluoromethyl group. Although ambi- 
tious, this approach would provide a large 
payoff in efficiency by allowing trifluoro- 
methyl groups to be installed at late stages of 


synthetic routes, without first preparing pre- 
functionalized or trifluoromethyl-containing 
substrates. It would therefore decrease the 
number of synthetic operations necessary to 
access a variety of trifluoromethylated mol- 
ecules (Fig. 1c). 

The authors** recognized that the devel- 
opment of a successful method hinged on 
identifying a practical trifluoromethyl source. 
Both groups independently surmised that 
the reliable generation of highly reactive tri- 
fluoromethy] radicals (“CF;) as intermediates 
might allow the direct trifluoromethylation of 
non-functionalized substrates, in what Ji et al.’ 
call an “innate trifluoromethylation” In other 
words, the inherent reactivity of the substrates 
would dictate the site (or sites) at which the 
trifluoromethylation reaction occurs. 

The two groups pursued different 
approaches to develop their own mild, efficient 
methods to access reactive trifluoromethyl- 
radical intermediates. Ji et al. developed a tran- 
sition-metal-free approach that relies solely on 
air- and moisture-stable reagents (which are 
easier to handle than commonly used orga- 
nometallic reagents and transition-metal 
catalysts that react with air and moisture). 
They determined that the Langlois reagent° 
(NaSO,CF;) serves as a reliable source of “CF, 
when treated with t-butylhydroperoxide (a 
commonly used reagent in radical chemis- 
try). Nagib and MacMillan‘, however, chose 
triflyl chloride (CISO,CF;) as their source of 
‘CF; (refs 7, 8). They generated the radicals 
simply by shining light from a household fluo- 
rescent bulb onto a solution of triflyl chloride 
in the presence of a light-activated ruthenium 
catalyst. Both methods** generated ‘CF; in 
the presence of a diverse range of substrates, 
and provided access to an array of structur- 
ally complex trifluoromethylated molecules. 
Furthermore, both methods worked reliably 
even for substrates bearing potentially sensi- 
tive functional groups (those whose presence 
ina molecule often adversely affects the course 
of a range of chemical reactions, including 
traditional trifluoromethylation reactions). 

To demonstrate the versatility of their meth- 
ods, both groups conducted trifluoromethyl- 
ations of common pharmaceuticals and 
natural products. Ji et al.’ successfully achieved 
C-H trifluoromethylations of the natural- 
product derivative dihydroquinine and the 
billion-dollar drug Chantix (varenicline, used 
to combat nicotine addiction), providing prod- 
ucts bearing a single trifluoromethyl group. 
These examples are significant because access- 
ing CF,-dihydroquinine and CF;—Chantix 
using previously available trifluoromethylation 
methods would require laborious, multi-step 
total syntheses. Notably, the reactions were 
chemoselective, even though the starting 
materials contained multiple reactive sites. 

Similarly, Nagib and MacMillan’ subjected 
the cholesterol-lowering drug Lipitor (ator- 
vastatin) to their reaction conditions. This 


experiment resulted in the non-selective for- 
mation of three chromatographically sepa- 
rable isomers of CF,-Lipitor, in which the 
trifluoromethyl group was appended to dif- 
ferent carbons in the molecule. The improved 
efficiency of this approach for accessing new 
chemical entities cannot be overstated, given 
that preparation of these isomers using tra- 
ditional methods would require three paral- 
lel total syntheses, which would be extremely 
time- and labour-intensive. 

Although the new methods” constitute a 
major contribution to the field of trifluoro- 
methylation chemistry, many challenges 
remain. For example, it would be desirable 
to have precise control over the site at which 
C-H trifluoromethylation occurs. This might 
be accomplished through the development 
of reaction conditions that combine the best 
attributes of the currently available methods. 
Complementary protocols that selectively 
produce different trifluoromethyl-containing 
molecules froma single substrate would also be 
advantageous, because it would avoid the often 
difficult task of separating mixtures of isomers 
generated as products. Ji et al.’ report their ini- 
tial efforts towards addressing this problem, 
and their preliminary results are encouraging. 
This area of research will continue to bea major 
focus in organic chemistry, and future strate- 
gies for trifluoromethylation will certainly be 
influenced by the current reports”. 
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In the shorter term, these breakthroughs 
open the door to the preparation of a diverse 
range of molecules that would previously have 
been difficult to access. The ability of these 
streamlined approaches to perform late-stage 
trifluoromethylations on structurally complex 
molecules will undoubtedly have an impact on 
the work of research chemists in the pharma- 
ceutical, agrochemical and other industries. = 
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The clock within 


The molecular clock machinery regulates organisms’ responses to daily 
variations in the environment. One unexpected response seems to be temporal 
fine-tuning of stem-cell behaviour in the skin. SEE ARTICLE P.209 


LORENA AGUILAR-ARNAL & 
PAOLO SASSONE-CORSI 


of most organisms follow a daily, or circa- 

dian, rhythm. This contributes to adapta- 
tions to environmental variables such as the 
light-dark cycle. Although in mammals the 
coherent ticking of circadian oscillations is 
ultimately orchestrated by a tiny region of the 
brain — the suprachiasmatic nucleus’ — the 
oscillations are intrinsic to almost all tissues 
and cells, including the epidermal layers of the 
skin. Circadian oscillations in gene expression 
are controlled by a number of factors that con- 
stitute the molecular clock. Strikingly, in the 
epidermis this molecular clock is not solely 
dedicated to the regulation of the tissue’s circa- 
dian rhythms. On page 209 of this issue, Janich 
et al.’ report that the clock also controls the 
alternate cycles of dormancy and proliferation 


le general, the physiology and behaviour 


in mouse epidermal stem-cell populations. 
This finding places the molecular clock in a 
strategic position for the fine-tuning of epider- 
mal homeostasis. 

Adult stem cells are present in various 
organs and tissues, where they reside in 
distinct areas within specialized micro- 
environments, or ‘niches. They remain ina 
slow-cycling state until needed for the mainte- 
nance of tissue homeostasis. However, they can 
also become active in response to malignant 
signals, leading to inflammation and cancer. 
The transition of stem cells from dormancy 
to proliferation remains poorly understood. 
One question is why only a few stem cells of 
a population respond to proliferation and 
differentiation signals. Janich and colleagues 
provide unanticipated insight into this aspect 
of stem-cell biology. 

The molecular machinery controlling cir- 
cadian functions involves a complex network 


8 DECEMBER 2011 | VOL 480 | NATURE | 185 


© 2011 Macmillan Publishers Limited. All rights reserved 
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ations of common pharmaceuticals and 
natural products. Ji et al.’ successfully achieved 
C-H trifluoromethylations of the natural- 
product derivative dihydroquinine and the 
billion-dollar drug Chantix (varenicline, used 
to combat nicotine addiction), providing prod- 
ucts bearing a single trifluoromethyl group. 
These examples are significant because access- 
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using previously available trifluoromethylation 
methods would require laborious, multi-step 
total syntheses. Notably, the reactions were 
chemoselective, even though the starting 
materials contained multiple reactive sites. 
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the cholesterol-lowering drug Lipitor (ator- 
vastatin) to their reaction conditions. This 


experiment resulted in the non-selective for- 
mation of three chromatographically sepa- 
rable isomers of CF,-Lipitor, in which the 
trifluoromethyl group was appended to dif- 
ferent carbons in the molecule. The improved 
efficiency of this approach for accessing new 
chemical entities cannot be overstated, given 
that preparation of these isomers using tra- 
ditional methods would require three paral- 
lel total syntheses, which would be extremely 
time- and labour-intensive. 

Although the new methods” constitute a 
major contribution to the field of trifluoro- 
methylation chemistry, many challenges 
remain. For example, it would be desirable 
to have precise control over the site at which 
C-H trifluoromethylation occurs. This might 
be accomplished through the development 
of reaction conditions that combine the best 
attributes of the currently available methods. 
Complementary protocols that selectively 
produce different trifluoromethyl-containing 
molecules froma single substrate would also be 
advantageous, because it would avoid the often 
difficult task of separating mixtures of isomers 
generated as products. Ji et al.’ report their ini- 
tial efforts towards addressing this problem, 
and their preliminary results are encouraging. 
This area of research will continue to bea major 
focus in organic chemistry, and future strate- 
gies for trifluoromethylation will certainly be 
influenced by the current reports”. 
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In the shorter term, these breakthroughs 
open the door to the preparation of a diverse 
range of molecules that would previously have 
been difficult to access. The ability of these 
streamlined approaches to perform late-stage 
trifluoromethylations on structurally complex 
molecules will undoubtedly have an impact on 
the work of research chemists in the pharma- 
ceutical, agrochemical and other industries. = 
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of most organisms follow a daily, or circa- 

dian, rhythm. This contributes to adapta- 
tions to environmental variables such as the 
light-dark cycle. Although in mammals the 
coherent ticking of circadian oscillations is 
ultimately orchestrated by a tiny region of the 
brain — the suprachiasmatic nucleus’ — the 
oscillations are intrinsic to almost all tissues 
and cells, including the epidermal layers of the 
skin. Circadian oscillations in gene expression 
are controlled by a number of factors that con- 
stitute the molecular clock. Strikingly, in the 
epidermis this molecular clock is not solely 
dedicated to the regulation of the tissue’s circa- 
dian rhythms. On page 209 of this issue, Janich 
et al.’ report that the clock also controls the 
alternate cycles of dormancy and proliferation 


le general, the physiology and behaviour 


in mouse epidermal stem-cell populations. 
This finding places the molecular clock in a 
strategic position for the fine-tuning of epider- 
mal homeostasis. 

Adult stem cells are present in various 
organs and tissues, where they reside in 
distinct areas within specialized micro- 
environments, or ‘niches. They remain ina 
slow-cycling state until needed for the mainte- 
nance of tissue homeostasis. However, they can 
also become active in response to malignant 
signals, leading to inflammation and cancer. 
The transition of stem cells from dormancy 
to proliferation remains poorly understood. 
One question is why only a few stem cells of 
a population respond to proliferation and 
differentiation signals. Janich and colleagues 
provide unanticipated insight into this aspect 
of stem-cell biology. 

The molecular machinery controlling cir- 
cadian functions involves a complex network 
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of auto-regulatory feedback loops that 
affect gene expression. The positive 
components of this machinery — the 
transcriptional activators CLOCK and 


BMALI — drive rhythmic expression A 

of the clock-controlled genes. Among Clock OFF y alee 

these are the genes that encode the Epidermal 

negative components of the molecular Cinder stem cells 

clock, the repressor proteins crypto- oscillations Bei 
SKIN Cells 


chrome (CRY1 and 2) and period 
(PER1-3)*. Additional factors, such 
as other clock-related proteins, post- 
translational modifications and small 
molecules that can feed back into 
the clock, add further complexity to 
the system, thereby ensuring regu- 
lated expression of at least 10% of the 
cellular genes in a circadian manner’. 

Janich et al.’ studied a mouse model 
engineered to express a fluorescent 
protein in a way that reflects circadian 
clock oscillations. The authors find 
that, in the hair-follicle bulge (one of 
the two locations where epidermal 
stem cells reside), two subpopulations 
of stem cells coexist. These differ in the 
way that they express the core compo- 
nents of the clock machinery (Fig. 1). 
This, in turn, results in differential 
expression of several genes known to control 
stem-cell dormancy, proliferation and differ- 
entiation. Among these are genes that encode 
TGE- factors, which inhibit proliferation and 
differentiation, and Wnt-signalling factors, 
which promote growth. Consequently, at any 
one time, one subpopulation is more prone to 
proliferation and differentiation. 

What could be the advantage of two stem-cell 
subpopulations coexisting in the same niche? 
Stem-cell heterogeneity may have evolved to 
allow the cells both to self renew, thus replen- 
ishing their reserve in the niche, and to keep a 
ready-to-go population that can respond to the 
signals that trigger differentiation. Whether the 
two epidermal stem-cell subpopulations repre- 
sent two lineages with distinct developmental 
origins, and whether they can interconvert, 
should be determined (Fig. 1). 

In subsequent experiments, Janich and 
co-workers disrupted circadian rhythm in 
mice by selectively deleting the Bmal1 gene in 
the basal keratinocyte cells of the animals’ skin. 
The hair-follicle bulges of these mice contained 
fewer proliferative cells and a higher number of 
dormant stem cells than those of normal mice. 
Moreover, Bmall-deficient bulge stem cells 
were more responsive to TGF-6 signalling and 
exhibited decreased expression of Wnt-related 
genes while maintaining higher expression lev- 
els of TGF-B-related genes. When the authors 
deleted the clock inhibitors Per1 and Per2, they 
observed the opposite effect — increased pro- 
liferation of bulge stem cells. Thus, the positive 
and negative components of the clock machin- 
ery seem to function as alternative rheostats of 
epidermal stem-cell proliferation. 


Clock ON.” 
“ LTGF-B, 
T Wnt 


Proliferation, 
differentiation 


Figure 1 | Stem cells and circadian oscillations. Epidermal stem 
cells coexist as two subpopulations: one that remains dormant and 
the other that becomes active to maintain epidermal homeostasis. 
Janich et al.’ report that, in response to circadian oscillations, the 
two subpopulations differentially express components of the clock 
machinery. This, in turn, affects expression of proteins such as Wnt 
signalling factors, which promote proliferation, and TGF-6 factors, 
which inhibit growth. Other factors such as metabolites and 
hormones could also be involved in regulating epidermal stem-cell 
heterogeneity. The ‘decision-making’ processes probably involve 
metabolic and epigenetic control. Whether activated stem cells can 
become dormant again remains unknown. 


Previous studies have also suggested a role 
for the circadian clock in stem-cell biology. For 
example, the release of haematopoietic stem 
cells — which develop into blood cells — from 
the bone marrow into the bloodstream follows 
acyclical pattern that seems to be regulated by 
the circadian clock through the sympathetic 
nervous system’, 

Adult stem-cell differentiation, which is 
essential for tissue homeostasis, has also been 
linked to circadian oscillations. Clock genes, 
for instance, mediate inhibition of bone for- 
mation from osteoblast cells, and several 
mouse models deficient in clock genes exhibit 
increased bone mass relative to controls’. 
Moreover, embryonic fibroblasts derived from 
mice lacking either Bmall or Per2 show altered 
ability to differentiate into adipocytes”, indi- 
cating that the clock machinery affects forma- 
tion of the fat tissue. Janich et al. also find that 
disruption of Bmal1 in the skin of mice induces 
epidermal ageing and reduces tumorigenesis. 
These pieces of evidence clearly point to a 
role for the clock in maintaining skin-tissue 
homeostasis. 

Janich and co-authors’ findings” consolidate 
the existing idea of crosstalk between circadian 
rhythms and differentiation. They also raise 
several questions. The undifferentiated state 
of the cell has been correlated with a lack of 
circadian physiology’. It is therefore important 
to determine which stimuli initiate the clock’s 
ticking in the niche. It could be that, whereas 
differentiating stimuli can mediate this process 
in embryonic stem cells’, signals originating 
from the central nervous system activate adult 
stem cells. Other unknowns include how 
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and when communication between 
the central pacemaker in the supra- 
chiasmatic nucleus and the peripheral 
clocks it synchronizes is established, 
and how relevant it is to the differen- 
tiation process. 

Anemerging theme is that the met- 
abolic state of cells is crucial for clock 
regulation and that the clock machin- 
ery contributes to the maintenance of 
the metabolic state. Indeed, the find- 
ing that the metabolic mediator NAD* 
and its biosynthetic pathway are com- 
ponents of the circadian oscillatory 
network””” is evidence for modula- 
tion of the clock machinery through 
metabolic pathways. Because stem- 
cell metabolism is reprogrammed in 
response to differentiation signals”, 
the possibility that cyclical metabolism 
may contribute to epidermal stem-cell 
physiology is intriguing. Deciphering 
the relationship between circadian 
oscillations and stem-cell metabolism 
could lead to new targets to manipu- 
late the differentiation and function 
of adult stem cells for regenerative 
medicine. 

The information conveyed by differ- 
entiation signals is integrated into stem cells 
epigenetically — through chemical modifi- 
cations of the genome that do not affect the 
DNA sequence but are inherited after cell 
division — to establish the cells’ fate. Does the 
clock machinery contribute to this process? 
Accumulating evidence indicates that intri- 
cate chromatin (DNA-protein complexes) 
remodelling events that are involved in the 
establishment of several epigenetic marks in 
the genome follow a circadian pattern, and 
ultimately control circadian gene expression”. 
Establishment of such a clock-regulated epi- 
genome during differentiation raises the 
attractive possibility that differentiating cells 
use the clock machinery to direct the epige- 
netic marks that define certain gene-transcrip- 
tion patterns in the new cell type. The existence 
of these fascinating links should become clear, 
as epigenetic features of epidermal stem cells 
are already being explored’*™*, 

This paper’ raises yet another intriguing 
question: is this mechanism of clock-mediated 
epidermal stem-cell control conserved in other 
adult stem cells? Expect exciting insight into 
stem-cell biology in the light of Janich and 
co-workers’ data. m 


Lorena Aguilar-Arnal and Paolo Sassone- 
Corsi are at the Center for Epigenetics and 
Metabolism, School of Medicine, University of 
California, Irvine, Irvine, California 92697- 
4625, USA. 

e-mail: psc@uci.edu 

1. Schibler, U. & Sassone-Corsi, P. Cel/ 111, 919-922 


(2002). 
2. Janich, P et al. Nature 480, 209-214 (2011). 


3. Zhang, E. E. & Kay, S. A. Nature Rev. Mol. Cell Biol. 
11, 764-776 (2010). 

4. Méndez-Ferrer, S., Lucas, D., Battista, M. & Frenette, 
P.S. Nature 452, 442-447 (2008). 

5. Fu,L, Patel, M. S., Bradley, A., Wagner, E. F. & 
Karsenty, G. Cel! 122, 803-815 (2005). 

6. Shimba, S. et al. Proc. Natl Acad. Sci. USA 102, 
12071-12076 (2005). 

7. Grimaldi, B. et al. Cell Metab. 12, 509-520 (2010). 

8. Yagita, K. et a/. Proc. Nat! Acad. Sci. USA 107, 


ASTROPHYSICS 


3846-3851 (2010). 
9. Nakahata, Y., Sahar, S., Astarita, G., Kaluzova, M. & 
Sassone-Corsi, P. Science 324, 654-657 (2009). 
10.Ramsey, K. M. et al. Science 324, 651-654 (2009). 
11.Nakada, D., Levi, B. P. & Morrison, S. J. Neuron 70, 
703-718 (2011). 
12.Masri, S. & Sassone-Corsi, P. Nature Neurosci. 13, 
324-1329 (2010). 
13.Luis, N. M. et al. Cell Stem Cell 9, 233-246 (2011). 
14.Mejetta, S. et a/. EMBO J. 30, 3635-3646 (2011). 


— 


Monster black holes 


A combination of ground-based and spacecraft observations has uncovered two 
black holes of 10 billion solar masses in the nearby Universe. The finding sheds 
light on how these cosmic monsters co- evolve with galaxies. SEE LETTER P.215 


MICHELE CAPPELLARI 


iant black holes, with masses of a few 

billion times that of the Sun, have 

fascinated scientists, science-fiction 
writers and the general public alike since they 
were first proposed’ four decades ago. These 
supermassive black holes, which lurk in the 
centre of galaxies, are not just theoretical 
curiosities. Their existence was convincingly 
demonstrated 15 years ago, and soon astro- 
physicists realized that such black holes can 
have profound effects on how galaxies form’. 
This realization spurred a flurry of studies aim- 
ing to understand the joint evolution of galax- 
ies and black holes. On page 215 of this issue, 
McConnell et al.’ present the detection of the 
two most massive black holes ever found. They 
argue that their finding provides a key missing 
piece of evidence to our understanding of how 
galaxies and black holes form. 

The existence of monster black holes was 
originally invoked to explain the intense 
energy released by active galaxies known 
as quasars. These galaxies are especially 
numerous at very large distances from Earth, 
corresponding to a time when the Universe 
was less than half its current age. Their tre- 
mendous brightness is thought to be the last 
flash from gas accelerated to extreme speed 
before being accreted by a giant black hole in 
the galaxy’s nucleus. There is now less gas in 
the nearby Universe than there was at early 
epochs, because most of it was used to make 
stars. For this reason, no quasar exists in our 
cosmic backyard. However, if supermassive 
black holes are responsible for quasars’ power 
production, they should still lie dormant in the 
centre of the most massive nearby galaxies. 

Indeed, systematic searches for supermas- 
sive black holes have found them at the centre 
of all massive galaxies for which reliable 
determinations of the black-hole mass exist 
(see McConnell et al.’ for an up-to-date list). 
But such black holes are not massive enough 


to power the brightest quasars. The ideal can- 
didate galaxies for hosting these largest black 
holes are the massive elliptical galaxies found 
at the centre of galaxy clusters. Like giant spi- 
ders at the centre of a web, these galaxies lie at 
the bottom ofa cluster’s gravitational potential 
well, feeding their black hole by accreting gas 
and stars from neighbouring galaxies. How- 
ever, the nearest big galaxy cluster to Earth is 
about 100 megaparsecs away, and this large 
distance makes determining the mass of their 
putative black holes a challenge. 

McConnell et al.’ targeted the central ellipti- 
cal galaxy of two massive clusters, using two of 
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the world’s largest telescopes and observations 
from the Hubble Space Telescope. They found 
two black holes, each with a mass of more than 
10 billion solar masses, in the nuclei of the two 
galaxies. These objects probably represent the 
missing dormant relics of the giant black holes 
that powered the brightest quasars in the early 
Universe. 

The mass of the supermassive black holes 
studied so far is closely related to the amount 
of random motion — the velocity dispersion 
— of the stars in the central parts of the host 
galaxies*”. The velocity dispersion is related to 
the mass of the spheroidal component, the stel- 
lar bulge, of the galaxies. The existence of this 
empirical relationship is attributed to the fact 
that the black hole grows by feeding from the 
gas that is accreted by its host galaxy. This gas 
accretion leads to the simultaneous growth of 
both the black hole and the galaxy stellar bulge, 
until the intense energy produced by the black 
hole heats up or blows away the gas, halting 
both star formation and black-hole accretion® 
(Fig. 1a). 

Interestingly, the two newly measured super- 
massive black holes are more massive than 
would be predicted from their velocity disper- 
sion. This suggests that, unlike their smaller 
counterparts, these black holes did not grow 
most of their mass by gas accretion but instead 
grew by the ‘dry’ merging of gas-poor galax- 
ies. In a dry merger of two similar galaxies, the 
resulting galaxy’s stellar and black-hole masses 
are the sums of the corresponding masses of 
the two progenitor galaxies (Fig. 1b). However, 


Jet Stellar bulge 


Black hole 


™ 2» 
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Figure 1 | Two ways to grow galaxies and their central black holes. a, In the gas-accretion hypothesis, 

a pure-disk, gas-rich spiral galaxy, possibly with a small black hole, accretes cold gas either from the 
environment (1) or by merging with another spiral galaxy (not shown). The gas sinks to the galaxy’s 
centre, forming a black hole anda stellar bulge. This process terminates when the galaxy becomes ‘active’ 
and produces two opposing high-speed jets of particles (2). The end product is a lenticular galaxy, which 
has a supermassive black hole, disk and stellar bulge, and larger stellar velocity dispersion than the initial 
galaxy (3). b, Alternatively, the galaxy and black hole can grow by ‘dry merging: a gas-poor lenticular galaxy 
accretes a similar galaxy (1). The two galaxies merge and form stellar shells, while their black holes spiral 
towards the centre until they coalesce (2). The result is a spheroidal galaxy that has a larger size and bigger 
black hole than the progenitors, but nearly unchanged velocity dispersion (3). McConnell and colleagues’ 
analysis’ indicates that the second route may dominate the growth of the most massive black holes. 
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contrary to the gas-accretion case, the velocity 
dispersion of the final galaxy’s stars remains 
unchanged, or changes little if the mass dou- 
bling is achieved through a sequence of several 
smaller mergers of gas-poor galaxies’. Either 
way, the merging process leaves behind a black 
hole that is more massive than would be pre- 
dicted from the mass-velocity dispersion 
relationship — just like the two black holes 
observed by McConnell and colleagues’. 

Not everybody agrees that the observed 
mass-velocity dispersion relationship indi- 
cates that black holes regulate galaxy growth 
by stopping gas accretion with their energy 
production (Fig. 1a). Studies*” have shown that 
starting from a set of galaxies containing black 
holes of random masses, a sequence of mergers 
can lead to a relationship close to the observed 


Noise rules 


one without the need for energy input from the 
black hole. However, this simulated relation- 
ship deviates systematically from the observed 
one. These claims*” can be tested by accurately 
measuring black-hole masses in galaxies with 
different properties, as McConnell et al. did. 
Progress requires the largest telescopes and 
the use of state-of-the-art technologies such as 
integral-field spectroscopy, to map the motion 
of the stars in two dimensions, and adaptive 
optics, to correct the blurring effect of Earth's 
atmosphere and achieve sharp images’’. The 
future looks bright for black-hole studies 
using the next generation of 40-metre tele- 
scopes, such as the European Extremely Large 
Telescope, which will significantly increase 
the number of galaxies that can be reliably 
investigated. m 


The idea that gene variants alone control an organism’s traits is overly simple. 
A study of the effects of gene interactions on the outcomes of random variation 
in gene expression reveals the complex reality. SEE LETTER P.250 


HANA EL-SAMAD & JONATHAN S. WEISSMAN 


ver since Gregor Mendel and Thomas 
Bicen crystallized the idea of the gene 

as the heritable unit, biologists have 
been trying to map the relationship between 
genotype (the complement of gene variants 
present in an individual) and phenotype (an 
individual’s physical characteristics). On 
page 250 of this issue, Lehner and colleagues’ 
provide insight into the complexity of this 
relationship by showing how random variation 
in the expression of members of pairs of inter- 
acting genes can act synergistically to affect 
the development of a complex organism — the 
roundworm Caenorhabditis elegans. 

Notable advances in DNA sequencing dur- 
ing the past few years have made it possible to 
define the genotype of any individual rapidly 
and cheaply. As a result, scientists glibly talk 
about the possibility of pinpointing individual 
genes that influence every aspect of our being, 
from our propensity to get cancer to our 
chances of living to be 100 years old. However, 
this appealing vision oversimplifies a much 
more complex reality. There is no one-to-one 
relationship between genotype and pheno- 
type — even identical twins can have radically 
different personalities, disease susceptibilities 
and life trajectories. 

Although many factors contribute to the 
mapping of genotype to phenotype, the influ- 
ence of two of these is especially challenging to 
account for. The first is ‘noise’ — the molecular 
fluctuations inherent in biological systems 


that cause random switching of genes on and 
off. The second is that genes do not act alone, 
but work together to form functional cellular 
networks. That results in complex genetic 
interactions in which the phenotypic impact 
ofa particular variant (or allele) of a gene is 
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Figure 1 | Noise regulons and gene expression. 
Noise regulons are synchronized stochastic 
changes in functionally related genes, and are 
coordinated by an upstream signal. In this 
example, fluctuations (red arrow) in the expression 
of the transcription factor Hsf1 cause the 
coordinated expression of stress-response proteins, 
including the molecular chaperones Hsp4 and 
Hsp90. These proteins then cooperate to buffer 

the effects of a mutant allele. Such a mechanism 
helps explain why loss of tbx-9 in C. elegans causes 
abnormal development in only half of the worms 
affected by this mutation, a phenomenon studied 
by Lehner and colleagues’. 
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dependent on an individual’s full genetic 
make-up. For example, it is not uncommon 
for individual alleles to be beneficial on their 
own, but lethal when two copies are present. 

Alone, either of these factors could gener- 
ate an elaborate relationship between genotype 
and phenotype, but there is inevitably a com- 
plex interplay between the two. Lehner and 
colleagues’ study' examines this interplay. Pre- 
vious work’ has investigated how noise in gene 
expression influences the effects of a defective 
allele of the C. elegans skn-1 gene. Notably, the 
skn-1 mutant allele leads to defects in intestinal 
development in some, but not all, individuals; 
such incomplete development ofa phenotypic 
trait is known as partial penetrance. 

Using a clever approach (known as fluores- 
cence in situ hybridization) that enables the 
exact number of RNA molecules in a cell to 
be counted, this study” established that ran- 
dom fluctuations in certain factors lead toa 
variable ‘all-or-none’ expression of the tran- 
scription factor gene elf-2, which is dependent 
on skn-1. In wild-type worms, the system is 
robustly buffered against such fluctuations. 
But in the skn-1 mutants, the variability is 
unleashed, so that elt-2 is not activated in 
some cells, hence preventing normal develop- 
ment of the gut. These results illustrate how 
quantitative fluctuations in upstream com- 
ponents of a signal-transduction system can 
lead to qualitative differences in a downstream 
developmental event. 

Lehner and colleagues’ further explore the 
impact of stochastic fluctuations on the partial 
penetrance ofa mutation, while also examining 
their interplay with the gene network in which 
the mutant allele is involved. In particular, the 
authors address the question of how fluctua- 
tions in factors expressed early in development 
affect subsequent developmental switches. This 
necessitated the development of a new experi- 
mental strategy, because in the previous work” 
the worms were killed to measure messenger 
RNA levels, thereby precluding the observation 
of later developmental events. 


The authors’ used transcriptional report- 
ers — in which a second copy of the gene of 
interest drives the expression of a fluores- 
cent protein — to follow variations in gene 
expression in live worms. A key requirement 
of this approach (verified by the authors) is 
that fluctuations in expression of the reporter 
accurately reflect fluctuations in expression 
of the endogenous genes. This means that the 
reporter is tracking expression noise result- 
ing from differences in cellular environment 
— extrinsic noise’ — rather than the random 
turning on and off of individual promoters. 

Lehner and co-workers first investigated 
why mutations in the T-box transcription 
factor gene tbx-9 that completely abolish the 
gene’s function cause an incompletely pene- 
trant defect in C. elegans larval development: 
roughly half of the animals lacking tbx-9 
develop normally; the other half have muscle 
and epidermal defects. The authors observed 
that overexpression of thx-8 — a gene closely 
related to thx-9 — eliminates the defects caused 
by loss of thx-9. What’s more, loss of thx-9 in 
their mutants caused upregulation of tbx-8. 

More strikingly, the authors observed that 
differences in thx-8 expression were a strong 
predictor of the phenotypic effects of thx-9 loss 
— tbx-9-mutant worms that expressed tbx-8 
at a low level were considerably more likely to 
develop abnormally than were genotypically 
identical individuals that expressed tbx-8 more 
strongly. Thus, stochastic differences in the 
feedback loop that compensates for tbx-9 loss 
by upregulating tbx-8 contribute to the variable 
penetrance of the tbx-9 mutation. Lehner and 
colleagues observed a similar set of effects with 
the transcription factors flh-1 and flh-2, which, 
like thx-8 and tbx-9, are a pair of related genes 
that resulted from ancient gene duplication. 
This type of noisy feedback between related 
genes may therefore be quite common. 

Although these findings representa step for- 
wards, the variability of tbx-8 expression could 
not fully account for the variable penetrance of 
the tbx-9 mutation. What, then, is the source 
of the remaining variability? Molecular chap- 
erones — proteins that assist other proteins in 
folding — can buffer a wide range of cellular 
defects, especially the Hsp90 chaperone’ (also 
known as DAF-21 in C. elegans). Lehner and 
co-workers therefore investigated random 
fluctuations in chaperone levels in individual 
worms as a possible explanation of the miss- 
ing variability. They found that fluctuations in 
expression of daf-21 were a strong predictor 
of the effects of thx-9 loss. Importantly, the 
effects of the differences in tbx-8 and daf-21 
expression were independent of each other but 
synergistic: more than 90% of worms with high 
levels of daf-21 and tbx-8 expression developed 
normally, whereas roughly two-thirds of the 
animals in which expression of both genes was 
low did not. 

Lehner and colleagues found that the 
observed variability in penetrance of thx-9 loss 


in their study can, to a remarkable degree, be 
accounted for by variations in expression of 
tbx-8 and daf-21. This, in turn, raises the ques- 
tion of what underlying mechanisms in the cell 
cause the observed variability and how many 
other genes are affected by these fluctuations. 
In the case of daf-21, the authors show that the 
fluctuations are probably part of coordinated 
changes in the expression of a broad range 
of chaperone genes, including one known as 
hsp-4. Such ‘noise regulons’ — synchronized 
stochastic changes in functionally related 
genes — are particularly suitable for causing 
coordinated effects on cell physiology, because 
the related genes act together to carry out com- 
mon functions’ ( Fig. 1). However, the authors 
did not find evidence that fluctuations in thx-8 
expression correlate with variations in the 
expression of chaperones, and it may be that 
there are no other genes whose noise co-varies 
with that of thx-8. A more tantalizing possibility 
is that variations in the expression of tbx-8 also 
reflect a coherent cellular state in which other 
members of the thx-8/tbx-9 pathway fluctuate 
in unison. This raises the further question of 
how many such noise regulons exist, and what 
controls their activity. 
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More broadly, it is clear from recent 
studies” that stochastic fluctuations can have 
a big impact on the penetrance of gene alleles. 
A better understanding of the structure of 
noise — which genes tend to fluctuate together, 
and how these fluctuations are controlled — 
should provide crucial insight into the nature 
of the genotype-phenotype relationship. m 
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A sense for touch 


Will a sense of touch similar to that of humans ever be developed in robots? 
Results on the physics of friction for fingerprint -like ridges sliding across 
textured surfaces may lead the way to tactile robotic sensors. 


C. MATHEW MATE & ROBERT W. CARPICK 


hen we rub a finger across a surface, 
our sense of touch is remarkably 
adept at distinguishing between 


different textures’. For example, astute clothes 
shoppers can easily feel the difference in tex- 
ture between cotton and lower quality poly- 
ester fabrics, just as experienced cashiers can 
spot counterfeit banknotes from the feel of 
the paper. Writing in Physical Review Letters, 
Wandersman et al.” provide a major advance 
towards understanding the physics behind 
these tactile sensations, showing how a modu- 
lated frictional signal is generated by finger- 
print-like ridges rubbing against the roughness 
of an opposing surface. This improved under- 
standing of the relationship between friction 
and surface textures should have an impact in 
many areas, but particularly in the field of tac- 
tile sensors for robotic design. Incorporating 
sensors capable of mimicking the human sense 
of touch has long been recognized’ as impor- 
tant for improving the ability of robots to grasp 
objects firmly without damaging them. 
Wandersman et al.” present measurements 


and analysis of the tangential or friction force 
generated as a rubbery elastomer block with 
periodic surface ridges slides over a rough 
glass surface. The ridges on the elastomer 
surface, which have periods ranging from 125 
to 760 micrometres, serve as stand-ins for the 
epidermal ridges on human fingers (Fig. 1a). 
They show that, when pressed against the glass 
surface to form contact regions several mil- 
limetres in diameter, the elastomer deforms 
around the rough surface texture in the same 
way as the epidermal ridges would if pressed 
against such a surface. 

With numerous ridges in contact, one 
might expect that any periodic variation in the 
friction force due to these ridges would aver- 
age out; indeed, this can be shown analytically 
if the friction force generated in each micro- 
scopically small area in contact (much smaller 
than the width of a ridge) is strictly propor- 
tional to the local normal or loading force 
pressing the two surfaces into contact over 
this area. This linear relationship between 
the friction and load forces is usually referred 
to as Amontons’ law of friction‘, which states 
that the friction force, F, is proportional to 
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Figure 1 | The friction force. a, When a fingertip is rubbed against a rough surface, the net friction force 
acting on the epidermal ridges increases with the loading force that the finger exerts on the ridges to press 
them into contact with the surface. b, Wandersman et al.’ measure the net friction force that acts on an 
elastomer block as it slides against glass surfaces of differing roughness. The elastomer block has ridges 
similar in structure and elasticity to the epidermal ridges on a finger. Shown here is the instantaneous 


friction force F, normalized to the average friction force F. 


as a function of sliding distance. A is the 


ave? 


spacing of the ridges on the elastomer block and is 218 micrometres. The force has a slight oscillating 
component that has the same period as the separation between the ridges and increases with the degree 
of roughness on the glass. (Part b modified from ref. 2.) 


the loading force, L, or F=L, where pis the 
coefficient of friction. However, the authors 
find that, rather than this friction modulation 
averaging to zero, the net friction acting on the 
elastomer block goes up and down slightly (by 
up to a few per cent) during the sliding experi- 
ments (Fig. 1b), with the same period as the 
ridges. And, surprisingly, the amplitude of this 
modulation actually increases when the net 
loading force increases. 

One of the fundamental advances pro- 
vided by Wandersman et al.’ is their analysis 
showing that, if the local friction coefficient 
depends even slightly on pressure (which 
is equivalent to friction being slightly non- 
linear with load), the modulation in friction 
can increase with loading force. They find 
good agreement between the experiments and 
a model in which friction varies non-linearly 
with load: F= AL’, where y= 0.87 + 0.04 and 
A is a constant. In this model, the roughness 
of the surface against which the fingerprint- 
like ridges are being slid has the important role 
of providing a heterogeneous distribution of 
contact pressure locally along the ridges. As 
the roughness increases, a wider distribution 
of loading pressure occurs, leading to a larger 
modulation in friction as a result of the non- 
linear nature of friction with load. The spatial 
period of the ridges serves to concentrate the 
minute variation of friction caused by these 
texture-induced pressure modulations all 
at one frequency, making it much easier to 
discern this variation from the average net 
friction. 

The sliding of fingerprint-like ridges over 
surfaces is not the only area in which Wan- 
dersman and colleagues’ analysis should 
apply. Because friction forces are rarely strictly 
linear with loading forces (as postulated in 
Amontons law)’, we believe that this analysis 
could provide a valuable way to use friction 
fluctuations to characterize surface rough- 
ness on many types of material pairs sliding 


against each other. The amplitude and the 
load-dependence of the fluctuations reveal 
information on the surface’s topographic 
characteristics at length scales much smaller 
than that of the patterned ridges. As a result, 
we think that one exciting area to which the 
method developed by Wandersman et al.’ 
could be extended is the characterization of 
surface roughness down to the nanometre 
scale or even smaller atomic length scales. 
For example, for many years, atomic-scale 
modulations of friction have been observed 
when the sharp tip of an atomic force micro- 
scope (AFM) slides across the periodic 


arrangement of atoms on a crystalline surface’. 
However, these AFM experiments typically 
require very small loading forces (nano- 
newtons) to maintain a contact area of only 
a few nanometres in diameter in order to see 
the atomic-scale modulation of friction. But, 
perhaps, with suitably designed patterned 
ridges and friction sensors, this ability to sense 
the atomic-level contribution to the friction 
modulation could be extended from the cur- 
rent nanometre-sized contact zones of AFMs 
to millimetre-sized contact zones, allowing 
future robotic fingers to feel the atomic-level 
contribution to surface texture. m 
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Sacrifice for survival 


Cancer cells ignore oxygen availability, opting for less efficient, anaerobic ways of 
generating energy. The wisdom behind this choice seems to be in preventing the 
accumulation of reactive oxygen species, and so oxidative damage. 


NANA-MARIA GRUNING & MARKUS RALSER 


hen oxygen is plentiful, cells con- 

vert glucose to energy through the 

consecutive processes of glycolysis 
and oxidative respiration. However, cancer 
cells exhibit what is known as the Warburg 
effect: even in the presence of oxygen, they 
prefer the much less efficient process of glu- 
cose fermentation for energy production’. 
This seems counter-intuitive because rapid 
cell proliferation, which is required for tumour 
growth, has high energetic demands”. In a 
paper published in Science, Anastasiou and 
colleagues* provide evidence that cancer cells 
undergo this metabolic shift to clear reactive 
oxygen species (ROS) and so prevent oxidative 
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damage. Thus, reconfiguration of the central 
carbon metabolism to counteract oxidative 
stress seems to be a major prerequisite for 
cancer progression. 

Textbooks offer two possible explanations 
for the decline in respiratory activity during 
cancer development. First, with the increase in 
nucleotide and macromolecule biosynthesis, 
there is a shortage of carbon equivalents for 
oxidative respiration. Second, a higher speed 
of glycolysis makes anaerobic metabolism 
more efficient, with more lactate being gener- 
ated from pyruvate, the end product of glyco- 
lysis; this allows cancer cells to feed each other 
by shuffling lactate’’. 

Neither hypothesis fully explains the meta- 
bolic reconfiguration in cancer cells. For one, 


oxidative respiration occurs downstream of 
glycolysis, and so does not compete with gly- 
colysis for carbon equivalents and would not 
interfere with a high glycolytic flux. Moreover, 
unlike respiring cells, which shuffle pyruvate 
from the cytoplasm into the mitochondria — 
the organelles within which oxidative respi- 
ration occurs — cancer cells actively excrete 
the lactate they generate from pyruvate. This 
contradicts the proposal that cancer cells shut 
down respiration to save carbon equivalents 
for biosynthesis. Finally, even some non-can- 
cerous cells that do not make use of lactate 
(including yeast, T cells and induced pluri- 
potent stem cells) undergo a Warburg-like 
metabolic restructuring during rapid growth. 

Anastasiou and colleagues’ results* bring 
the redox balance centre stage to explain this 
metabolic reconfiguration. They show that the 
glycolytic enzyme pyruvate kinase — a main 
regulator of the Warburg effect — facilitates 
tumour growth by preventing accumulation of 
ROS, and so avoiding oxidative damage. 

In all living cells, ROS leak from the chain 
of reactions that constitute oxidative respira- 
tion, or are generated as by-products of both 
fatty-acid metabolism and biosynthetic redox 
reactions. Under normal physiological condi- 
tions this is not a problem, because ROS levels 
are kept low and in equilibrium with reducing 
molecules. In fact, a certain amount of ROS 
is necessary for normal physiology. But if the 
normal redox balance is disrupted, or ROS 
accumulate, oxidation and disturbed bio- 
chemical reactions damage macromolecules, 
ultimately leading to cell death. Therefore, 
cancer cells rely on a complex anti-oxidative 
machinery that can dynamically supply reduc- 
ing equivalents and clear ROS when required’. 

Pyruvate kinase is a regulator of cellular 
anti-oxidative metabolism. Of the four human 
isoforms of this enzyme, PKM2 plays a cru- 
cial part in cancer metabolism. Like other 
metabolic enzymes, PKM2 levels increase in 
tumours’. However, this protein has a unique 
regulatory role in that its decreased catalytic 
activity is associated with tumour progression 
and the development of the Warburg effect*”. 

When pyruvate kinase activity is low — as 
in cancer cells or in respiring yeast — its sub- 
strate, phosphoenol pyruvate, accumulates*”. 
This inhibits the glycolytic enzyme triose 
phosphate isomerase and leads to activation 
of a pathway alternative to glycolysis — the 
pentose phosphate pathway’. Increased activ- 
ity of this pathway protects against ROS in at 
least two ways. First, it provides NADPH, a 
reducing factor that is required for the activity 
of antioxidant enzymes and for the recycling of 
the anti-oxidant peptide glutathione. NADPH 
also compensates for the redox imbalance 
caused by increased nucleotide and fatty-acid 
synthesis*. Second, the pentose phosphate 
pathway regulates gene expression in favour 
of adaptation to oxidative stress”. 

Anastasiou and co-workers’ establish that 
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Figure 1 | Restructuring cellular metabolism. 
Glucose is converted to pyruvate by the 
cytoplasmic process of glycolysis, generating 
energy. When oxygen is present, pyruvate enters 
mitochondria, where it generates more energy 
through the process of oxidative respiration. 
But, in proliferating cells — and under anaerobic 
conditions — pyruvate is converted to lactate. 
In cancer and respiring yeast, reduced activity 
of pyruvate kinase, the enzyme that catalyses the 
final step of glycolysis, mediates redox balance 
by activating the pentose phosphate pathway’. 
Anastasiou et al.’ show that activation of this 
pathway is crucial for cancer cells, and facilitates 
tumour growth by limiting ROS accumulation 
and, therefore, oxidative stress. 


activation of the pentose phosphate pathway 
and its anti-oxidative activity are essential for 
cancer-cell growth (Fig. 1). They report that, 
in lung cancer cells, oxidation of PKM2 on 
the cysteine amino-acid residue 358 (Cys358) 
keeps its activity low. This increases both the 
concentration of glucose-6-phosphate — the 
metabolite that connects glycolysis to the oxi- 
dative, NADP’-reducing branch of the pentose 
phosphate pathway — and flux through the 
pentose phosphate pathway. 

The authors interfered with the pyruvate- 
kinase-triggered activation of the pentose 
phosphate pathway by increasing PKM2 
activity in the presence of oxidants. To do this, 
they mutated the enzyme’s Cys358 to a serine 
residue or used small-molecule activators. This 
treatment had remarkable effects on cancer- 
cell growth. Accumulation of ROS caused 
oxidative damage and slowed the prolifera- 
tion of cancer cells both in tissue culture and 
in tumours grafted into immunocompromised 
mice. 

These data suggest that inducing the 
Warburg effect promotes cancer growth by 
activating the pentose phosphate pathway, 
maintaining the balance of redox equivalents, 
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providing NADPH and activating antioxidant 
defence systems. The findings have notable 
implications for understanding the energetic 
balance during cancer development: block- 
ing pyruvate kinase to redirect the metabolic 
flux is energetically costly under conditions 
of low respiratory activity because it dimin- 
ishes the step that is responsible for the net 
yield of the cellular energy molecule ATP by 
glycolysis. This indicates that maintenance of 
the redox balance is more limiting for tumour 
growth than are energy levels or biosynthetic 
metabolism. 

Could this metabolic reconfiguration be 
exploited for therapeutic purposes? Poten- 
tially, yes. But targeting a fundamental redox- 
balancing process must be cancer-cell specific, 
otherwise it would heavily damage other 
metabolically active cell types, including liver 
cells, immune cells and neurons. Yet, PKM2, 
triose phosphate isomerase, the pentose phos- 
phate pathway and its associated metabolites 
are not cancer-cell specific. Nevertheless, a 
promising strategy might be to induce ROS 
overload in cancer cells, thereby making them 
vulnerable to oxidative damage by neutraliz- 
ing the protective effects of the Warburg effect. 
To develop such strategies it will be essential 
to pursue comprehensive quantitative and 
qualitative investigations to understand all 
the ROS-producing biochemical reactions in 
the cancer cell. = 
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CORRECTION 

In the News & Views article ‘Ageing: 
Generations of longevity’ by Susan E. 
Mango (Nature 479, 302-303; 2011), it 
was stated that transient exposure of rats 
to a high-sugar/low-protein diet leads to 
glucose intolerance. This should have read 
“transient exposure of rats to a high-sugar/ 
high-fat diet leads to glucose intolerance”. 
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Assembly of hybrid photonic architectures 
from nanophotonic constituents 


Oliver Benson! 


The assembly of hybrid nanophotonic devices from different fundamental photonic entities—such as single molecules, 
nanocrystals, semiconductor quantum dots, nanowires and metal nanoparticles—can yield functionalities that exceed 
those of the individual subunits. Combining these photonic elements requires nanometre-scale fabrication precision and 
potentially involves a material diversity that is incompatible with standard nanotechnological processes. Although 
merging these different systems on a single hybrid platform is at present challenging, it promises improved 
performance and novel devices. Particularly rapid progress is seen in the combination of plasmonic-dielectric 
constituents with quantum emitters that can be assembled on demand into fundamental model systems for future 


optical elements. 


by miniaturization. Following the path taken in electronics of 

reducing devices to their ultimately fundamental forms, for 
instance single-electron transistors'*, now magnetic and optical devices 
have also been scaled down, creating the increasingly active research 
fields of spintronics* and nanophotonics’. 

A fundamental nanophotonic device, involving single elementary 
excitations such as photons and spins, offers various advantages over 
a macroscopic object: its state can be controlled, initialized and read out 
with a precision at the fundamental quantum mechanical limit; its 
operational fidelity based on the interconversion of single excitations 
can approach 100%; and its functionality may be based on coherent 
rather than incoherent dynamics, allowing for devices that maintain 
quantum coherence, which is a crucial requirement for applications in 
quantum information processing®. Thus, a fundamental device could 
outperform traditional devices in terms of operation speed, integrability 
and energy efficiency. 

On the fundamental level, a device consists of various coupled and/or 
integrated entities, such as single emitters, optical microresonators, 
optical waveguides and photon-to-electron or photon-to-spin inter- 
faces. If different materials systems are involved (for example a com- 
bination of organic and inorganic units), then the resulting device might 
be appropriately described as ‘hybrid’. However, this term can also be 
used in a broader sense when new functionality is gained from a com- 
bination of different physical effects. Examples are the fusion of bioassays 
and micro- or nanoelectromechanical systems’ and that of microfluidics 
and optics*. A merging of quantum optics and nanomechanics’, photo- 
nics and atomic physics’®, or quantum optics and plasmonics'"’ is likely to 
provide new fundamental insight as well as novel applications such as 
quantum coherent devices’* or quantum-limited sensors'’. For example, 
a quantum computer, by contrast with its classical counterpart, needs to 
maintain coherent superpositions of discrete quantum states, acting as 
so-called quantum bits. A similar challenge occurs in quantum sensors, 
where exploitation of entangled quantum states allows for improved 
precision over classical equivalents. Whereas long storage times of 
quantum states may be possible, for example in atomic systems, fast 
read-out and information transfer using integrated optics, that is, an 
atomic-photonic hybrid system, would be desirable. The quantum 
mechanical decoherence and energy transfer processes in these hybrid 


T he advancement of modern technology is still driven considerably 


systems are, however, not fully understood. This broader class of hybrid 
system is the topic of this Review, and I place a particular emphasis on 
model systems combining metallic and dielectric nanophotonic con- 
stituents by scanning probe manipulation. 

I will begin by highlighting some key nanophotonic functionalities 
and then briefly introduce several bottom-up approaches for assembling 
hybrid nanophotonic systems from their fundamental components. As 
scanning probe techniques at present offer the greatest versatility, the 
successful application of such techniques for the assembly of hybrid 
nanophotonic architectures will be covered in most detail. Advanced 
top-down approaches based on layered composites", organic—inorganic 
synthesis’, molecular recognition” or soft-matter assembly’” will not be 
reviewed. 


Functionality on the nanoscale 

The intrinsic functionality of a nanophotonic device relies on it provid- 
ing a significant interaction between light and matter. The most fun- 
damental system consists of a single quantum emitter interacting with a 
single mode of the electromagnetic field, for example inside a micro- 
cavity’®. Strong and efficient confinement of the electromagnetic field 
greatly enhances the field strengths and, thus, the strength of the inter- 
action. There are basically two ways to confine the light and enhance its 
field strength: resonant enhancement in dielectric structures where cavity 
quantum electrodynamics’ (CQED) effects can be exploited (Box 1), or 
plasmonic enhancement using metals” (Box 2). In a hybrid device, we 
might even engineer a combination of the two. Examples of desired or 
desirable functionalities are as follows. 


Light guiding and sorting 

A key requirement for a nanophotonic platform is for its different sub- 
units, for example light sources, modulators, filters or detectors, to be 
connected by optical waveguides. This is a well-developed technique in 
standard light-wave communication networks and integrated optics. 
However, a nanophotonic circuit requires tighter confinement and 
low-intensity operation down to the single-photon level”. For dielectric 
systems, a particularly appropriate structure for nanophotonic devices is 
a photonic crystal”, in which a periodic modulation of the index of 
refraction of a dielectric can guide light tightly and control its dispersion. 
Figure 1b shows such a device (discussed further below): a photonic 


1Nano-Optik, Humboldt-Universitat zu Berlin, Newtonstrasse 15, 12489 Berlin, Germany. 
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BOX | 
Cavity QED 


Atypical configuration in which to study CQED?? is shown in the figure. 
A single quantum emitter, for example an atom, an ion, a single 
molecule or a quantum dot, is placed inside a microcavity. Two 
regimes of the time evolution of the system are distinguished. 

In the strong-coupling regime, the dominant process is the coherent 
interaction between the emitter and a single resonant mode of the 
microcavity characterized by the coupling constant g. Energy is 
exchanged between the emitter and the cavity mode in an oscillatory 
way at the Rabi frequency, 2g. The energy loss rate through either 
spontaneous emission from the excited emitter to a continuum of 
other modes (described by the rate y) and the photon loss rate through 
the cavity mirrors (described by the rate ic) is much smaller than the 
energy exchange rate () and x are both much less than g). The emitter 
and the field lose their separate identities and have to be described as 
one coupled system with an own energy-level structure’. The 
coupling constant g is proportional to 1/Vvv, where V is the 
quantization (mode) volume. On the other hand, « is inversely 
proportional to the cavity’s quality factor Q. The ratio Q/V/V should 
therefore be maximized to reach the strong-coupling regime. 

In the weak-coupling regime, the dynamics are incoherent. The 
coupling constant g is small relative to x and y. An excited emitter loses 
its energy in an exponential decay, but with a decay rate strongly 
modified relative to an emitter in free space. This is the Purcell effect?4. 
The decay is enhanced when the emitter’s transition frequency is in 
resonance with the cavity mode frequency, w = c/A(c, speed of light; 2, 
wavelength), or it may be strongly suppressed. For exact resonance 
and perfect alignment of the emitter’s dipole moment with respect to 
the cavity field in a medium with refractive index n, the enhancement, 
or Purcell factor, is P = (3/4n2)(A/ny°O/V. 

CQED effects were first studied in atomic physics experiments”°, 
mostly using Fabry-Pérot cavities, but are now widely explored in 
solid-state systems as well?®. 
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Box 1 Figure | Basics of CQED. a, A typical CQED system consisting of 
a single quantum emitter (two-level system) coupled coherently, with 
coupling constant g, to one mode of an optical microcavity. Incoherent 
processes are spontaneous emission from the emitter to the continuum at 
rate y, and photon loss through the cavity mirrors at rate x. b, Time 
evolution of the excited-state probability and spectrum for the weak- 
coupling (red) and strong-coupling (blue) regimes. 


crystal waveguide guiding light within a thin dielectric film. Frequency- 
selective elements required, for example, for add-drop elements have 
also been realized in photonic crystal structures”. 

Whereas the confinement of light in a dielectric structure is ultimately 
constrained by the diffraction limit, plasmonic waveguides (Box 2) allow 
concentration of combined electronic and optical excitations (plasmon 
polaritons) to length scales far below the optical wavelength. Wave 
guiding is possible along plasmonic wires or along ridges and grooves 
in thin metal films™*. A drawback is damping due to losses in the metal, 
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BOX 2 
Plasmonic enhancement 


Surface plasmon polaritons (SPPs) are electromagnetic excitations 
propagating at the interface between a dielectric and a conductor?**. 
They arise from the coupling of electromagnetic fields to oscillations of 
he conductor’s electron plasma. SPPs can be efficiently excited and 
guided along thin metal strips or wires, called SPP waveguides. Long- 
range propagation over distances of a few millimetres is possible at 
elecommunication wavelengths’. By comparison with their purely 
optical counterparts, the narrow spatial confinement to fractions of a 
ree-space optical wavelength makes them ideal building blocks of 
uture chip-based nanophotonic components and devices, although 
osses in the metal are a challenge. Passive optical elements such as 
bends, beam splitters and interferometers have been demonstrated*4. 

Subwavelength metal particles support localized SPP resonances. 
With gold or silver, the resonances are in the visible spectral range and 
have been used for centuries to stain glass. Current applications 
concern biomarkers or sensors. In addition to a lightning rod effect, 
that is, the field enhancement near sharp geometrical features, 
resonances further increase the local electromagnetic field near the 
metallic surface. Thus, SPPs couple strongly to small particles placed 
in the near field. One feature is that fluorescence and scattering 
processes are enhanced by orders of magnitude. A prominent 
example with a pronounced impact on single-molecule detection is 
surface enhanced Raman scattering: in experiments on this, 
enhancement factors of 10/4 have been observed”. Another feature is 
an increase in the efficiency of nonlinear optical effects for higher- 
harmonic generation’? or for low-intensity photonic switches on the 
single-emitter level*?. 

Modern nanofabrication technology makes it possible to fabricate 
plasmonic structures from the top down. More complex structures 
such as optical nanoantennae*? and metallic photonic crystal 
structures™, which further improve the performance of plasmonic 
nanophotonic elements*, have also been realized. 


Dielectric 


Zz 


Box 2 Figure | Basics of surface plasmon polaritons. a, An SPP asa 
collective excitation at a metal—dielectric interface”. The electromagnetic 
field (electric field, E, plotted in the z-x plane; magnetic field, Hy, sketched 
inthe y direction) _ is drastically enhanced. b, The perpendicular field E, 
decays exponentially with a characteristic length 6, (of the order of the 
optical wavelength) in the dielectric and a characteristic length of 6,, (the 
skin depth) in the metal. 


but strategies to combine plasmonic waveguides with gain material in 
hybrid systems to compensate for losses are being explored’. The wave- 
length-selective transmission of plasmonic nanostructures can also be 
used for efficient light or photon sorting”. 


Enhanced emission and absorption 

In the weak-coupling regime of CQED (Box 1), the Purcell effect can be 
used to funnel single or a few photons emitted from single emitters such 
as quantum dots, molecules or colour centres into specific cavity modes 
with enhanced rate and efficiencies of up to 98% (ref. 27). Efficient 
coupling to free space or to optical fibres is mandatory for improved 
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single-photon sources needed in secure quantum cryptography” or 
quantum intensity standards”. More-complex nanophotonic elements 
for these applications also implement funnelling of charge carriers for 
efficient electrical excitation of single quantum dots (Fig. la). The 
Purcell effect also significantly enhances the transition rate and, thus, 
the optical response of a device, which is relevant for fast optical modu- 
lation or interconnects. 

It has been proposed by Chang et al.*° that plasmonic effects will also 
lead to a strong enhancement of the interaction of single quantum emitters 
with localized excitations in plasmonic waveguides. A similar effect is 
exploited when enhancing the optical response of a plasmonic particle 
to the binding event of a large biomolecule, a technique that is used in 
plasmonic biosensors*'. Also, the impedance mismatch between freely 
propagating light and confined light can be addressed using plasmonic 
structures. Plasmonic nanoantennas™ can drastically enhance the emis- 
sion or absorption rate, whereas periodic metallic nanostructures establish 
highly directional emission* or provide unusual linear properties, such as 
a negative index of refraction, unavailable in natural materials**. These 
features are of paramount importance for optical outputs or optical inter- 
connects in future nanophotonic systems. 


Nonlinear elements and switches 

In the strong-coupling regime of CQED, spontaneous emission is negligible 
relative to the coherent and reversible energy exchange. This exchange is a 
perfect way to transfer electronic quantum states to photons. In terms of 
quantum information processing, this establishes an interface between 
stationary (electronic states) and moving (photons) quantum bits. Also, 
non-resonant interaction can be used to realize conditional phase shifts of 
photon states’’. This is the main functionality required to build an optical 
quantum computer*’, which in principle could solve problems that no 
classical computer could’. 

The energy-level structure of a strongly coupled emitter-cavity system 
(the ‘dressed states’) is an array of manifolds of increasing energy, each of 
which corresponds to a discrete excitation of the system. A narrow-band, 
weak probe beam used to address a strongly coupled system can be on- or 
off-resonant with respect to the energy difference between two manifolds. 
In the first case it will be transmitted, but in the second case it will be 
reflected. As the energy of the manifold depends on the number of 
photons present in the cavity, the presence or absence of even a single 
photon can induce transmission (photon-induced tunnelling) or 
reflection (photon blockade) of the beam*®. This peculiar effect allows 
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the generation of transmitted light with a non-classical photon statistics”, 
and, eventually, the realization of single-photon sources or transistors”®. 

Waks and Vuckovic** have shown that a strong nonlinearity can be 
produced in a cavity-emitter system even if the system is not in the 
strong-coupling regime. In fact, a single emitter placed inside a cavity 
add-drop filter can switch on or off the transmission of a coupled 
waveguide mode as a result of interference effects. This ‘dipole-induced 
transparency’ may be used to build a quantum repeater”, a device that 
helps to transfer quantum entanglement between two partners. Such a 
capability would establish fundamentally secure information exchange 
over arbitrarily large distances. Figure 1b shows a possible experimental 
configuration for dipole-induced transparency realized using a quantum 
dot and a photonic crystal structure*’. 

Ina similar way, a single emitter on a plasmonic waveguide can act as 
a mirror that can be saturated by only a single excitation. Such an 
element could be the key ingredient for a single-photon transistor’! as 
sketched in Fig. 1d. It could provide the missing nonlinear element for 
photonic quantum gates, which have so far relied on interference and 
single-photon detection, for example in integrated photonic circuits”! 
and telecommunication fibre systems”. 

A scaling up of fundamental devices will involve two or more 
elementary entities, for example single emitters, optical resonators or 
non-resonant optical nanostructures. Configurations may be arranged 
to act as optical switches, where the state of a single emitter changes the 
emission frequency of a nearby emitter by means of dipole-dipole inter- 
action** (Fig. 1c). 


Approaches to nanoassembling hybrid devices 


To exploit fundamental interactions in nanophotonic—plasmonic 
hybrid structures, advanced nanoassembly techniques are required. 
Various strategies to fabricate fundamental hybrid devices out of 
different materials and subunits have been demonstrated. Figure 2 
summarizes different approaches schematically. 


Random assembly 

The simplest idea is a random approach by which, for example, different 
nanoparticles are spin-coated or drop-cast on a substrate (Fig. 2a). By 
chance, the desired configuration may be found within the ensemble. 
This approach was used by, for example, Kinkhabwala et al.“*, who spin- 
coated a solution with molecules on a sample with ‘bow-tie’ plasmonic 
antennas. Sometimes they found a single molecule well within the 


Figure 1 | Nanophotonic functional elements. 
a, Advanced, electrically driven single-photon 
source: a single semiconductor quantum dot 
embedded in a light-emitting-diode structure 
including a distributed Bragg reflector** (DBR). 

b, Scanning electron microscopy (SEM) image of 
an integrated photonic structure to test dipole- 
induced transparency. The structure includes a 
grating coupler (left; also shown magnified) and a 
quantum dot inside a photonic crystal resonator 
coupled to a photonic crystal waveguide” (also 
shown magnified). c, Proposed logic gate based on 
three near-field-coupled semiconductor quantum 
dots (QDs) with appropriately aligned energy 
levels*’. Depending on the state of two near-field- 
coupled quantum dots (QD A and QD B, 
representing two bits) that can be controlled by 
optical input pulses, resonant energy transfer to a 
third quantum dot (QD C, serving as output) is 
possible. d, Proposal for a single-photon 
transistor*’. A light pulse from a dielectric 
waveguide is transferred into a plasmon 
propagating along a tapered metal nanowire. In the 
tapered region, interaction with even a single 
emitter is so large that its quantum state state can 
control the transmission of the pulse. 


Tapered nanowire 
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antenna gap. Such an approach is very inefficient and cannot be scaled 
up to complex configurations involving three or more constituents”. 


Self-assembly 

If there are interactions between the entities (submicrometre particles, 
large molecules and the like) that are strong enough to overcome thermal 
diffusion, then self-ordering may lead to deposition of the particles in 
ordered patterns (Fig. 2b). Van der Waals”, electrostatic'* or magnetic*® 
interaction may be used to establish self-ordering. For example Min et 
al.° describe the formation of cobalt nanoparticle rings and cadmium 
nanorod superlattices under the influence of magnetic and electrostatic 
fields, respectively. The advantage of this technique is the control of 
patterns through modifications of the interactions and the possible 
fabrication of many configurations in parallel. 


Directed self-assembly 

Control of spatially ordered structures can be improved by growth or 
assembly on preprocessed or locally functionalized substrates (Fig. 2c). 
Etched defects or metal nanoparticles can act as seeds for self-assembled 
growth of single semiconductor quantum dots” or nanowires**. Mohan 
et al.”” used electron beam lithography and etching to fabricate a regular 
pattern on a GaAs substrate. In this way, pyramidal quantum dots could 
be grown in a regular array. Also, patterning with chemical*® markers 
allows deposition of specific molecules or nanoparticles on predefined 
sites. Such self-assembly approaches provide spatially well-separated 
structures that can be addressed individually. Apart from physical 
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Figure 2 | Different approaches to assembling nanophotonic elements. 

a, Random approach: various particles are deposited randomly. By chance, two 
particles (here a yellow one anda green one) are close enough to form a coupled 
pair. b, Self-assembly: interparticle interactions, for example electrostatic fields 
(red lines), mediate a periodic configuration. c, Directed self-assembly (seeded 
growth): nanowires (green) are grown predominately on the prepatterned 
defects on a substrate. d, Directed self-assembly (field-guided assembly): 
negatively charged particles (red spheres) agglomerate along a positively 
charged wire pattern (green stripes of width W). e, Select and post-process: the 
position of a quantum dot is measured with respect to markers and 
subsequently a photonic crystal structure (periodic holes) are fabricated. 

f, Scanning probe assembly: particles deposited using a dip pen are precisely 
positioned with the help of an AFM tip. 
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defects, local electric, magnetic or optical fields can also create favourable 
spots for the deposition of particles down to the single-atom level 
(Fig. 2d). 


Select and post-process 

This approach relies on the high precision of modern nanofabrication 
used, for example, for today’s metal-oxide semiconductor technology. 
The approach that I here call ‘select and post-process’ starts with the 
fabrication of samples with a random distribution of nano-objects, for 
example optically active quantum emitters such as quantum dots or nano- 
wires’'. Subsequently, one object of interest is selected and its relative 
position on the sample is measured with high precision. Frequently, the 
substrate is patterned with an array of markers to facilitate this procedure. 
In a subsequent post-process step, electric contacts, microresonators or 
other elements are fabricated around or next to the selected object by 
optical or electron beam lithography (Fig. 2e). Using this technique, a 
fundamental nanophotonic system consisting of a single semiconductor 
quantum dot strongly coupled to an optical microresonator (a resonant 
defect mode in a photonic crystal structure) was realized”’. As the select 
and post-process approach starts with a randomly distributed ensemble, 
it cannot be scaled up to assemble structures with more than one emitter. 
A possible solution, however, is post-processing of structures with site- 
controlled emitters””. 


Scanning probes for manual assembly 

By comparison with the previously described approach, scanning probes 
offer the ultimate limit of nanomanipulation, allowing individual con- 
stituents to be assembled. First demonstrations with scanning tunnelling 
microscopes’ (STMs) and atomic force microscopes** (AFMs) showed 
impressive results in regards to the positioning of single atoms* on 
surfaces. 

STM and AFM scanning probes have developed into versatile tools for 
nanomanipulation and nanoassembly and are particularly useful in 
assembling hybrid structures. Not only large molecules*®, but also metallic 
nanoparticles” or nanorods”*, semiconductor nanoparticles”, carbon 
nanotubes” and plastic particles*', can be manipulated on a large variety 
of substrates. Modifications of ‘dip-pen’ nanolithography® make it 
possible to pick up and release nanoparticles in a controlled way*’ 
(Fig. 2f). Also, optical or scanning electron microscopes can be equipped 
with triple-axis micromanipulators for simultaneous imaging and 
manipulation. 


Nanophotonic- plasmonic hybrid devices 


Among the various approaches to assembling hybrid systems summarized 
above, scanning probe techniques provide the widest versatility, and so 
will be the focus of this section. Using such techniques, different con- 
figurations can be assembled with the same constituents, allowing a 
detailed investigation of the constituents’ interactions. Furthermore, once 
assembled, the configurations can be encapsulated to be used as stable 
nanophotonic units in various applications, such as fibre-coupled single- 
photon sources™ for quantum information processing or quantum light 
standards. 

Whereas nanoassembly with scanning probes has been widely used in 
developing electronic”, molecular** and biomolecular°® devices, the 
assembly of nanophotonic elements has reached this point only recently 
and is still in its early stages. Progress has been hindered by the lack of 
photostable nanoscopic emitters that allow easy manipulation. At the 
level of very few or even single emitters under ambient conditions, 
protection against photobleaching by overgrowth or embedding in films 
is often required, preventing any additional positioning of the emitters 
on the substrate. Optically stable emitters appropriate for scanning 
probe assembly are metallic (plasmonic) nanoparticles”, semiconductor 
nanowires” or colour centres in diamond nanocrystals”. Colour 
centres have particular potential as they represent single quantum emit- 
ters even at room temperature® (the most widely studied centre is the 
nitrogen-vacancy centre®°). 
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To illustrate the power of scanning probes to assemble nanophotonic 
hybrid structures in a controlled way, I highlight two examples in more 
detail. 


Plasmonically enhanced single-photon sources 

Figure 3 shows a manually assembled, hybrid, single-photon source oper- 
ating at room temperature. To construct it, appropriate constituents 
(here metal nanoparticles and diamond nanocrystals) were first isolated 
and precharacterized. This was done with an AFM (in tapping mode, 
where only modest interaction with the particles occurs), to map the topo- 
logy of a larger area, and with subsequent optical investigation, that is, 
measurements of fluorescence or photon correlations. This investigation 
provides information about the number of emitters in the particle”. This 
first step is similar to the select and post-process approach described above. 
However, in a second step active manipulation with an AFM tip (now in 
contact mode to exert significant force on the particles) was performed 
and complex configurations were fabricated in a step-by-step procedure 
(Fig. 3). This is sufficient for a well-defined coupling to plasmons, which 
requires an accuracy of some tens of an optical wavelength. 

In this example, a single nanodiamond containing a single nitrogen— 
vacancy defect centre was coupled to a single gold particle or sandwiched 
between two gold particles forming a plasmonic nanoantenna. Plasmonic 
enhancement (see Box 2 and previous paragraph) leading to an increase 
in photon yield (number of photons emitted per unit of excitation 
power) of two orders of magnitude was achieved at room temperature. 
Even higher values should be possible** in a more advanced antenna 
structure*’, which should also allow directional emission. This excellent 
coupling of single emitters to plasmonic wires may be used in improved 
single-photon sources*’ to produce large nonlinearities for the realization 
of single-photon transistors*' or to launch single plasmonic excitations 
into quantum plasmonic circuits. 


Nanowire photonic elements 

In another example (Fig. 4), various dielectric and plasmonic nanowires 
were manually assembled to form coupled lasers and waveguides®. The 
end facets of a small semiconductor nanowire can act as optical mirrors 
forming a Fabry-Pérot-type optical resonator. Optical excitation of the 
wire’s photoluminescence can bring the system above the lasing threshold. 
In Fig. 4a, two such lasers, respectively made from GaN and ZnO wires, 
were coupled to a SnO) nanoribbon”'. A triple-axis manipulator with a 
tungsten probe tip was used to arrange the configuration manually. The 
SnO,; nanoribbon acts as an efficient single-mode waveguide to collect and 
guide the laser light both from the GaN wire and the ZnO wire. 
Remarkably, the coupling efficiency of laser light into the waveguide 
approaches 50%, and light can be guided with a loss of ~10dBcm'. 
Optimum coupling is achieved when wire and ribbon are in the 
‘staggered-bonded configuration’, that is, in physical contact over dis- 
tances of a few micrometres. Simultaneous guiding of light from two 
(or more) nanolasers establishes the possibility of performing all- 
nanowire nonlinear wave mixing within single nanocavities. Moreover, 
the ability to transfer coherent optical pulses over hundreds of micro- 
metres could be applied to shuttle packets of electro-optical information 
in future computing and communications devices. 
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Figure 3 | Stepwise AFM nanoassembly of a plasmonic nanoantenna. a-c, A 
single nitrogen-vacancy defect centre in a diamond nanocrystal (a) is 
sandwiched between two gold nanoparticles (60 nm in diameter) (b, c). The 
sketches (bottom) illustrate the configuration mapped by false-colour AFM 


imaging (top). 
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Figure 4 | Nanoassembled photonic-plasmonic wire structures. a, Dark- 
field image of two nanowire lasers (GaN and ZnO, each ~130 nm in diameter) 
coupled to a SnO, nanoribbon waveguide’ (with a rectangular cross-section of 
240 nm X 260 nm). b, Dark-field (left) and optical (right) images of a hybrid 
optical-plasmonic routing device consisting of a silver nanowire bridging two 
SnO, nanoribbon waveguides”. Excitation light is incident from the bottom of 
the image (indicated by the arrow labelled k,). 


In Fig. 4b, it is shown how a silver plasmonic nanowire can be used to 
link two SnO, nanoribbon waveguides”. A transfer of light can be seen 
in the optical image. The system represents a hybrid optical-plasmonic 
routing device that combines the subwavelength confinement of light in 
a plasmonic structure with low-loss guiding in dielectric waveguides. 
Efficient integration and connection between plasmonic structures on a 
nanophotonic circuit can be realized in this way, which is a requirement 
for the single-photon transistor concept depicted in Fig. 1d. Connection 
of these structures to the ‘outside world’ with tapered optical fibres has 
also been demonstrated”. 

It must be noted that in the example in each of the two previous 
subsections, a hybrid approach is mandatory. In the first example there 
is no well-established fabrication technology for the material (advanced 
photonic structures in diamond), whereas in the second example the 
required properties of the material (low damping of the silver nanowire) 
can be established only by chemical synthesis, which is incompatible 
with lithographic fabrication. Nanomanipulation by means of micro- 
manipulators’”* or advanced scanning probe techniques” comple- 
ments the hybrid approach in an ideal way. 


Future prospects 


So far, several simple nanophotonic hybrid devices have been assembled 
with scanning probes or micromanipulators. Figure 5 shows some 
examples additional to those discussed in the previous section. Scaling 
up to produce more-complex nanophotonic elements in which a small 
but arbitrary number of emitters are controllably coupled to dielectric or 
plasmonic nanostructures is in principle straightforward. 

A critical outstanding problem is to connect nanophotonic hybrid 
systems to macroscopic components. Tapered optical fibres” or plasmonic 
antenna structures” have proved to be excellent couplers to propagating 
light in optical waveguides and free beams, respectively, with efficiencies 
exceeding 90%. A strategy to relax the need for free-beam coupling is to 
incorporate sources and detectors on the nanophotonic chip itself. 
Examples of integrated emitters were provided in Figs 3-5 and on-chip 
detection of single surface plasmons was demonstrated recently”’. 

It has to be mentioned that manual assembly with scanning probes has 
its own flaws. Emitters in solid-state matrices show an inhomogeneous 
broadening, with the result that in nanophotonic elements exploiting 
resonances, individual tuning of the emission frequency of each emitter 
is required. However, techniques for such individual tuning have been 
developed’’. Also, emitters embedded in very small carrier particles, for 
example defect centres in diamond nanocrystals, often show spectral 
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Figure 5 | Different systems assembled using scanning probes. a, SEM 
image of four nanodiamonds arranged in a linear chain”. b, SEM image of 
single nanodiamond, containing nitrogen-vacancy centres, placed on the rim 
of a SiO> microdisk resonator”'. c, SEM image of a nanodiamond at the end of a 
nanomanipulator positioned on the end facet near the core region of an optical 
fibre”’. With this set-up, fluorescence from the nitrogen—vacancy centre is 
coupled directly into the optical fibre. d, Dark-field image SnO, nanoribbon 
grid’'. Inset, scanning electron image of the junction at the lower right vertex. 
e, False-colour AFM image of two gold nanoparticles assembled to form a 
plasmonic nanoantenna on a photonic crystal heterostructure cavity etched 
into a thin Siz3N4 membrane” 


diffusion, blinking and linewidth broadening’*. Furthermore, assembling 
a nanophotonic structure one constituent at a time with a precision of 
only a few nanometres is instructive for fundamental research and in 
proof-of-principle experiments, but it is not acceptable as a method of 
production of devices on a larger scale. However, the successful operation 
of the first nanophotonic hybrid model systems has motivated efforts to 
modify or improve established fabrication methods. For example, elec- 
tron beam and optical lithography have been used to fabricate photonic 
crystal structures in materials in which this has not previously been 
possible (for example diamond” and GaP*®), to develop plasmonic struc- 
tures with reduced damping” and to realize integrated optical circuits for 
optical quantum technology. Another approach is to combine different 
techniques with the goal of surpassing their individual limitations. One 
example of this is ion implantation through a perforated atomic force 
cantilever, which simultaneously provides nanometre spatial resolution 
and single-dopant operation’. 

The first key functionalities have been successfully demonstrated in 
hybrid photonic architectures assembled from fundamental nanophotonic 
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constituents. Single photons can be generated, routed, detected and inter- 
converted into other fundamental excitations such as single surface 
plasmons. Also, nonlinear interactions facilitating logical operations 
are feasible using CQED or plasmonic effects. There is great potential 
to establish additional functionality in a hybrid approach by combining 
optical, magnetic and electrical interactions. First steps towards 
nanomagnetometry™, surface plasmon lasers**, organic plasmon- 
emitting diodes*® and single-molecule or single-photon transistors*” have 
been taken. 


Most of the systems realized so far represent model systems, but a 


remarkable level of complexity has already been reached. These manually 
assembled photonic architectures are an exciting test area in which to 
develop ideas for new functionalities in future nanophotonic hybrid 
elements. 
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New gene functions in megakaryopoiesis 
and platelet formation 


A full list of authors and their affiliations appears at the end of paper. 


Platelets are the second most abundant cell type in blood and are essential for maintaining haemostasis. Their count and 
volume are tightly controlled within narrow physiological ranges, but there is only limited understanding of the 
molecular processes controlling both traits. Here we carried out a high-powered meta-analysis of genome-wide 
association studies (GWAS) in up to 66,867 individuals of European ancestry, followed by extensive biological and 
functional assessment. We identified 68 genomic loci reliably associated with platelet count and volume mapping to 
established and putative novel regulators of megakaryopoiesis and platelet formation. These genes show 
megakaryocyte-specific gene expression patterns and extensive network connectivity. Using gene silencing in Danio 
rerio and Drosophila melanogaster, we identified 11 of the genes as novel regulators of blood cell formation. Taken 
together, our findings advance understanding of novel gene functions controlling fate-determining events during 
megakaryopoiesis and platelet formation, providing a new example of successful translation of GWAS to function. 


To discover novel genetic determinants of megakaryopoiesis and 
platelet formation, we performed meta-analyses of GWAS for mean 
platelet volume (MPV) and platelet count (PLT). Our analyses 
included 18,600 (13 studies, MPV) and 48,666 (23 studies, PLT) indi- 
viduals of European descent, respectively, and up to ~2.5 million 
genotyped or imputed single nucleotide polymorphisms (SNPs)'. 
Briefly, we tested within each study (Supplementary Table 1) the 
associations of MPV and PLT with each SNP using an additive model; 
we then combined these study-specific test statistics in a fixed-effects 
meta-analysis. To reduce the risk of spurious associations, we applied 
common stringent quality control filters and the genomic control 
method? to the meta-analysis, which shows no evidence for residual 
inflation of summary statistics (Supplementary Fig. 1). 

A total of 52 genomic loci reaching statistical significance at the 
genome-wide adjusted threshold of P= 5 X 10° were discovered in 
this stage 1 analysis; 55 additional loci reached suggestive association 
(5X10 °<P<5X10 °). We tested one SNP per locus in a stage 2 
analysis that included in silico and de novo replication data in up to 
18,838 individuals from 12 additional studies, confirming 15 addi- 
tional loci (Supplementary Table 2). One further independent locus 
(TRIM58) associated with PLT was identified through detection of 
secondary association signals. Overall, 68 independent genomic 
regions were associated with PLT and MPV with P=5 x 10 °, of 
which 52 are new and 16 were described previously in Europeans*° 
(Table 1). Of the 68 loci, 43 and 25 loci were associated significantly 
with PLT and MPV, respectively; 16 of them reached genome-wide 
significance with both traits (Supplementary Fig. 2). This partial over- 
lap reflects the negative correlation of both traits (gender-adjusted 
r= —0.49, Fig. 1a) that results from the tight control of platelet mass 
(PLT X MPV)’. The association of some loci with both PLT and MPV 
may reflect this negative correlation between the two traits or inde- 
pendent pleiotropic effects of a locus on megakaryopoiesis and platelet 
formation. The different statistical power at the two traits and small 
effect sizes at many loci reduce our power to discriminate among loci 
controlling MPV and PLT through analysis of platelet mass. Their 
testing will require the collection and analysis of PLT and MPV in 
large independent homogeneous cohorts. Some loci, however, have a 
clear-cut effect. For instance, BAK1 affects PLT specifically, compatible 
with its role in apoptosis and platelet lifespan. 


We further tested the association of the 68 loci in 7,949 (MPV) and 
8,295 (PLT) samples of south Asian and 14,697 (PLT) samples of 
Japanese’ origin. We detected substantial overlap of association sig- 
nals, with effect size and direction highly concordant with findings in 
Europeans (Supplementary Fig. 3 and Supplementary Table 3). In the 
south Asian sample, 15 of the 68 (22.1%) loci were significant after 
adjustment for multiple testing (P=7 X 10“). In the Japanese 
sample, 13 of 55 (23.6%) PLT loci showed significance. Moreover, 
73 of 84 (87%, South Asians) and 45 of 55 (82%, Japanese) SNPs 
showed associations with effect estimates directionally consistent 
with Europeans. Such concordance is highly unlikely to be due to 
chance (P=2.3X10 |? and P=2.1X 10 °), and provides inde- 
pendent validation of the locus discovery in Europeans. 

The 68 loci cumulatively explain 4.8% of the phenotypic variance in 
PLT and 9.9% in MPV, accounting respectively for average increases of 
2.57 X 10°1_' PLT and 0.10 fl MPV per copy of allele. These levels of 
explained variance are in accordance with other GWAS of complex 
quantitative traits’. Our results indicate that many other common 
variants of similar or lower effect size, rare variants as well as structural 
variants may also contribute to the variation of both platelet traits. We 
used the method of ref. 10 to estimate the number of additional PLT- 
and MPV-associated loci having effect sizes comparable to those 
observed in our analysis. The method (with caveats discussed in the 
Supplementary Information) predicted that 137 and 81 such loci exist 
for PLT and MPV respectively, accounting for 9.7% and 18.3% of the 
total phenotypic variance. 


Gene-prioritization strategies 

Evidence from recent, highly powered meta-analyses suggests that the 
association peaks are enriched for genes controlling key underlying 
biological pathways'’’. In our case, a large proportion of the asso- 
ciation signals (46 out of 68) had the most significant SNP in stage 1 
(‘sentinel SNP’) mapping to within a gene-coding region, including 
several key regulators of haemostasis (ITGA2B, F2R, GP1BA), mega- 
karyopoiesis (THPO, MEF2C) and platelet lifespan (BAK1). Through 
an unbiased analysis of our GWAS results, we estimated that PLT- 
associated SNPs are significantly more likely to map to gene regions 
than expected by chance (P < 0.05, Supplementary Fig. 4), suggesting 
that we may prioritize the search of additional yet unknown genes 
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Table 1 | Summary of loci associated with platelet count and mean platelet volume in Europeans 
GWAS © Trait Sentinel SNP Chr Position Cytoband Locus Effect/other n Effect (s.e.)t P valuet Het. Rep.{ Refs# 
locus (build 36) (build 36) allele* P value 
1 MPV rs17396340 1 10,208,762 1p36.22 KIF1B A/G 21,612 0.008(0.002) 2.83 x 10-8 0.83 - - 
2 PLT —_rs2336384 i 11,968,649 1p36.22 MFN2 G/T 57,366 2.172 (0.382) 1.25 x10°8 0.31 - - 
3 MPV rs10914144 i 170,216,372 1q24.3 DNM3 C/T 18,589 0.014(0.001) 1.11 x 10°24 0.46 Yes 6 
PLT rs10914144 1 170,216,372 T/C 54978 3.417(0.487) 2.22 x 10-12 0.79 Yes 
4 MPV rs1172130 1 203,511,575 1q32.1 TMCC2 G/A 21,141 0.011(0.001) 3.82 x 10-2” 0.17 Yes 6 
PLT  _rs1668871 1 203,503,759 C/T 58,108 2.804(0.368) 2.59x10 14 0.45 - 
5 PLT rs7550918 1 245,742,181 1q44 LOC148824 T/C 54,171 3.133(0.471) 291x101 0.85 - - 
6 PLT _rs3811444 1 246,106,073 1q44 TRIM588 C/T 27,955 3.346(0.574) 5.60*x 10°° 0.66 - 

7 PLT _rs1260326 2 27,584,443 2p23 GCKR T/C 54396 2.334(0.381) 9.12 x 10-12 0.11 Yes - 
8 MPV rs649729 2 31,317,888 2p23.1 EHD3 T/A 20,850 0.008(0.001) 1.17 x10~!4 0.61 Yes 6 
PET. rs625132 2 31,335,803 G/A 45,217 4.236(0.568) 915x104 0.98 - 

9 PLT rs17030845 2 43,541,382 2p21 THADA C/T 65,738 3.577(0.556) 1.27 x 10° 1° 0.40 - - 
10 MPV rs4305276 2 241,143,685 2q37.3 ANKMY1 G/C 20,618 0.008(0.001) 1.71 x1071!3 0.71 - - 
11 PLT — rs7616006 3 12,242,647 3p25 SYN2 A/G 58,564 1.997(0.366) 4.86 x 107% 0.20 - - 
12 PLT rs7641175 3 18,286,415 3p23 SATB1 A/G 58,366 2.757(0416) 337x107 0.34 - - 
13. MPV rs1354034 3 56,824,788 3p14.3 ARHGEF3 T/C 18,286 0.023(0.001) 3.31 x 10-© 0.00 Yes 4,6 
PLT _rs1354034 3 56,824,788 C/T 49,135 6848(0.442) 2.86x10 % 0.50 Yes 
14 PLT — rs3792366 3 124,322,565 3q21.1 PDIA5 G/A 58,335 2.153(0.365) 3.60 x 10° 0.07 - - 
15 MPV rs10512627 3 125,822,911 3q21.1 KALRN C/G 21,108 0.006(0.001) 5.10 x 19-7° 0.41 - - 
16 MPV rs11734132 4 6,942,419 4p16.1 KIAA0232 G/C 17,444 0.011(0.002) 1.11x10~13 0.20 - - 
17 PLT — rs7694379 4 88,405,532 4q22.1 HSD17B13 A/G 56,430 2.129 (0.37) 8.70 x 10° 0.44 Yes - 
18 MPV rs2227831 5 76,059,249 5q13.3 F2R G/A 21,654 0.021 (0.003) 9.65 x 10 %6 0.11 - - 
PLT rs17568628 5 76,082,694 T/C 44,759 6.074(0.993) 9.61 x 10-1 0.77 - 

19 PLT rs700585 5 88,187,872 5q143 MEF2C C/T 55,469 2.703 (0.442) 9.86 x 10° 1° 0.06 - - 
MPV rs4521516 5 88,135,706 G/C 28,157 0.008 (0.001) 1.89 x 10°° 0.39 - 

20 PLT — _rs2070729 5 131,847,819 5q31.1 IRF1 A/C 56469 2394(0.371) 1.13x 1O=1° 0.73 - - 
21 MPV rs10076782 5 158,537,540 5q33.3 RNF145 A/G 18,025 0.007(0.001) 448 x 108 0.52 - - 
22 PLT. rs441460 6 25,656,266 6p22.2 LRRC16 G/A 58,064 3.08 (0.359) 8.70 x 10 18 0.61 - - 
23 PLT — rs3819299 6 31,430,345 6p21.33 HLA-B G/T 48,687 5.048 (0.824) 880 x 10=1° 0.90 - - 
24 PET rs399604 6 33,082,991 6p21.32 HLA-DOA C/T 57,674 2.346(0.365) 1.30 x 10°1° 0.23 - - 
25 PLT rs210134 6 33,648,186 6p21.31 BAK1 G/A 58,554 4.957(0.396) 7.11 x10-%° 0.67 Yes 68 
26 PLT — rs9399137 6 135,460,710 6q23.3. HBS1L-MYB C/T 57,857 5.901 (0.41) 5.04 x 10-47 0.74 Yes 8 
27 MPV rs342293 7 106,159,454 7q22.3 FLJ3603 1- G/C 20,193 0.017(0.001) 7.03 x10-°” 0.19 Yes 5,6 
PLT 1$342275 7 106,146,451 PIK3CG C/T 58,571 3.742(0.363) 5.57 x10~25 0.17 - 

28 PLT — rs4731120 7 123,198,458 7q31.3 WASL C/A 66,147 4.14 (0.592) 2.77 x 10-14 0.46 - - 
29 PLT —_rs6993770 8 106,650,703 8q23.1 ZFPM2 A/T 54,960 3.668(0.437) 430x10-?” 0.14 - - 
30 PLT —_rs6995402 8 145,077,548 8q24.3 PLEC1 C/T 57,593 2.304(0.371) 509x 10-2° 0.10 - - 
31 MPV rs10813766 9 321,489 9p24.3 DOCK8 T/G 21,104 0.007 (0.001) 3.68 x 10°14 0.45 - - 
32 PLT rs409801 9 4,734,742 9p24.1 AK3 C/T 56,063 5.585 (0.378) 2.59 x 10-49 0.47 - 6 
33 PLT rs13300663 9 4,804,947 9p24.1 RCL1 C/G 48,092 5.585 (0.483) 9.83 x 10° 8° 0.64 Yes 8 
34 PLT = rs3731211 9 21,976,846 9p21.3 CDKN2A A/T 54,529 3.281(0.438) 643x104 0.86 Yes - 
35 PLT rs11789898 9 135,915,483 9q34.2 BRD3 T/G 57,391 3.014(0.476) 2.39 x 101° 0.70 - - 

36 MPV rs7075195 0 64,720,664 10q21.2 JMJD1C A/G 21,226 0.014(0.001) 239x10 0.89 Yes 
PLT rs10761731 0 64,697,615 T/A 54344 3.849(0.378) 2.02 x 10 *4 0.56 Yes 

37 PLT rs505404 1 233,267 11p15.5 PSMD13- G/T 54,642 4.662(0.453) 7.44x10~25 0.86 - 6 
MPV rs17655730 1 260,714 NLRP6 T/C 20,875 0.01 (0.001) 2.29 x 10°18 0.29 - 

38 PLT  rs4246215 1 61,320,874 11q12.2 FEN1 T/G 56,299 2.451 (0.39) 3.31 x 101° 0.41 Yes - 

39 PLT rs4938642 1 118,605,115 11q23.3 CBL C/G 56,605 4.73 (0.727) 7.66 x 10713 0.98 Yes - 

40 MPV rs1558324 2 6,159,479 2013.31 CD9-VWF A/G 20,387 0.01 (0.001) 1.55 x 10°21 0.48 Yes - 
PLT — rs7342306 2 6,161,353 G/A 55,636 2.532(0.384) 429x101 0.14 - 

Al MPV rs2015599 2 29,326,746 2p11.22 MLSTD1 A/G 21,102 0.008(0.001) 5.55 x 10° %6 0.76 - - 

42 MPV rsl10876550 2 52,998,574 2q13.13 COPZ1- G/A 21,214 0.008(0.001) 1.86x10 14 0.38 Yes - 

NFE2- CBX5 

43, MPV rs2950390 2 55,341,557 12q13.3 PTGES3- C/T 21,238 0.008(0.001) 7.45x10 14 0.75 - - 
PLT rs941207 2 55,309,550 BAZ2A G/C 55,653 2.751(0.431) 1.74~x 101° 0.33 - 

44 PLT _rs3184504 2 110,368,990 2q24.12 SH2B3 T/C 56,354 3.99 (0.374) 1.22 x 10°26 0.07 - 68 

45 PLT  rs17824620 2 111,585,376 2q24.13 RPH3A- C/A 51,530 2.457 (0.428) 9.67 x 10°° 0.26 - - 

PTPN11 

46 MPV rs7961894 2 120,849,965 2q24.31 WDR66 T/C 29,755 0.03 (0.001) 142x101 0.48 Yes 4,6 
PLT _rs7961894 2 120,849,965 C/T 51,897 3.923 (0.609) 1.22 x 10; !° 0.05 Yes 

47 PLT rs4148441 3 94,696,207 13q32 ABCC4 G/A 64,120 4.117 (0.6) 6.76 x 10 !% 0.38 Yes - 

48 MPV rs7317038 3 113,060,898 13q34 GRTP1 C/T 27,646 0.006(0.001) 827 x 107! 0.09 - - 

49 PLT — rs8022206 4 67,590,658 14q24.1 RADS1L1 G/A 52,251 3.197 (0.5) 1.55 x10°?° 0.59 - - 

50 PLT — rs8006385 4 92,570,778 14q31 ITPK1 G/A 64,929 3.587(0.558) 1.24 x 10°-?° 0.28 - - 

51 PLT rs7149242 4 100,229,168 14q32.2 C14orf70- G/T 61,247 2.142(0.385) 2.68 x 108 0.07 - - 

DLK1 

52 PLT rs11628318 4 102,109,839 14q32.31 RCOR1 A/T 62,438 2.572(0.405) 2.04 x 10°10 0.84 - 

53 PLT — rs2297067 4 102,636,537 14q32.32 C14orf73 T/C 41,687 3.538(0.553) 1.58 x 101° 0.79 - - 
MPV rs944002 4 102,642,567 A/G 22,910 0.008(0.001) 4.76x10~73 0.66 Yes 

54 MPV rs3000073 4 104,800,836 14q32.33 BRF1 G/A 21,229 0.007(0.001) 3.27x10 1 0.28 - - 

55 PLT — rs3809566 5 61,120,776 15q22.2 TPM1 G/A 57,113 2.443 (0.39) 3.65 x 10° 1° 0.33 - 6 

56 PLT rs1719271 5 62,970,853 15q22.31 ANKDDIA G/A 56,782 3.414(0.502) 105x107 0.00 - - 

57 PLT rs6065 a 4,777,160 17pter-p12 GP1BA T/C 64,987 4.191 (0.63) 2.92 x 10711 0.00 Yes 8 

58 PLT 1s397969 7 19,744,838 17p11.1 AKAP10 C/T 60,944 2.131(0.357) 232 x 10°° 0.92 - - 

59 MPV rs8076739 a 24,738,712 I7gli2 TAOK1 T/C 21,652 0.013(0.001) 4.59 x 10-38 0.12 - 4,6 
PLT 13559972 7 24,838,621 T/C 53,460 3.264(0.375) 3.30 x 10°18 0.25 
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Table 1| Continued 
GWAS © Trait Sentinel SNP Chr Position Cytoband Locus Effect/other n Effect (s.e.)+ P valuet Het. Rep. Refs# 
locus (build 36) (build 36) allele* P value 
60 PLT rs10512472 17 30,908,916 17q12 SNORD7- C/T 58,692 3.636(0.477) 240x101* 0.08 Yes - 
PV rs16971217 17 30,968,167 AP2B1 C/G 21,089 0.009(0.001) 3.77x10 1% 0.01 
61 PLT rs708382 17 39,797,869 17q21.31 FAM171A2- T/C 50,036 2.439(0.431) 1.51x108 0.46 - - 
ITGA2B 
62 PLT  rs11082304 18 18,974,970 18q11.2 CABLES1 G/T 58,215 2.48(0378) 527x101! 0.73 Yes - 
63 PV rs12969657 18 65,687,475 18q22.2 CD226 T/C 19,285 0.007(0.001) 3.36x10 1! 038 Yes 6 
64 PV _rs8109288 19 16,046,558 19p13.12 TPM4 A/G 13,964 0.029(0.004) 115x101! 0.14 - 3 
PLT — rs8109288 19 16,046,558 G/A 29,014 11.945(1.892) 2.75x10°1° 0.04 - 
65 PLT rs17356664 19 50,432,610 19q13.32 EXOC3L2 C/T 55,487 2.599(0.415) 3.60x101° 0.07 - - 
66 PV rs13042885 20 1,872,706 20p13 SIRPA C/T 21,186 0.008(0.001) 5.56x10 1% 0.56 - 
67 PV rs4812048 20 57,021,165  20q13.32 CTSZ- C/T 20,811 0.008(0.001) 1.30x10-° 0.06 - - 
TUBB1 
68 PLT rsl1034566 22 18,364,276 22q11.21 ARVCF T/C 61469 2.128(0.384) 3.06 x108 0.43 - - 
69 PLT rs6141 3 185,572,959 3q27 THPO|| T/C 39,366 2.467(0.456) 618x10° 0.59 Yes 24,8 
Results are provided for the 68 loci and 84 sentinel SNPs reaching genome-wide significant (P= 5 x 10~®) association with PLT or MPV. Results for stages 1 and 2 of the analysis in Europeans are provided in 
Supplementary Table 2. MPV, mean platelet volume; PLT, platelet count. 
* Alleles are indexed to the forward strand of NCBI build 36. 


+ Effect sizes in In(fl) for MPV and 10°17? for PLT. 
{All P values are based on the inverse-variance weighted meta-analysis model (fixed effects). 


§ TRIM58 identifies the only secondary signal identified in this study, derived from a genome-wide secondary signal discovery effort carried out by conditioning the discovery GWAS on all SNPs reaching 
significance in the stage 1 meta-analysis. The effects (s.e.) and P values reported are obtained in the secondary analysis. The corresponding values in the stage 1 analysis are effect (s.e.) = 2.721 (0.542) and 


P=4.06 x 10’. Further details of this analysis are given in the Supplementary Information. 


|| THPO narrowly misses the level required for nominal significance (P<5 x 10~®) in Europeans, but shows genome-wide significance in Japanese. 
Rep. indicates replication of European stage 1+2 results in non-Europeans (Supplementary Table 3): yes, if association P value is at least in one non-European population <0.0007 (to account for multiple testing 


of 68 loci). 
#Relevant references are indicated. 


controlling these processes in the associated regions. To define a 
univocal rule to study the enrichment of functional relationships in 
associated genes, we made the choice to focus on a set of 54 ‘core’ genes 
selected as either containing the sentinel SNP or mapping to within 10 
kb from an intergenic sentinel SNP (Table 2). This selection strategy is 
designed to obtain unbiased hypotheses producing interpretable 
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Figure 1 | Protein-protein interaction network and gene transcription 
patterns. a, Negative correlation between PLT and MPV ina UK sample. The 
gender-adjusted correlation coefficient r and trend line are shown. b, Protein- 
protein interaction network of platelet loci. For the nodes, genes are represented 
by round symbols, where node colour reflects gene transcript level in 
megakaryocytes on a continuous scale from low (dark green) to high (white). 
Grey-coloured round symbols identify first-order interactors identified in 
Reactome and IntAct. Core genes not connected to the main network are 
omitted. The 34 core genes are identified by a blue perimeter. Yellow perimeters 
identify five additional genes (VWF, PTPN11, PIK3CG, NFE2 and MYB) with 
known roles in haemostasis and megakaryopoiesis and mapping to within the 
association signals at distances greater than 10 kb from the sentinel SNPs. These 
genes, which do not conform to the rule for inclusion into the core gene list, are 
not considered in further analyses presented in Fig. 2c, d and Supplementary 


biological inference for genes near the association signals, but has 
reduced sensitivity for genes that map further from the sentinel SNP. 
For instance VWF, a key regulator of haemostasis, maps to 55 kb from 
the sentinel SNP (Supplementary Fig. 3 and Supplementary Table 4) 
and is therefore not considered as a core gene. We further note that this 
selection strategy does not imply knowledge of the location of causative 
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Fig. 5 and are shown here for illustration purposes only. Network edges were 
obtained from the Reactome (blue) and IntAct-like (red) databases and 
through manual literature curation (black). The network including the 34 core 
genes alone contains 633 nodes and 827 edges; after inclusion of the 5 
additional genes, the network (shown here) includes 785 nodes and 1,085 edges. 
The full network, containing gene expression levels and other annotation 
features, is available in Cytoscape” format for download (Supplementary Data 
1). c, d, Time course experiments of gene expression in megakaryocytes and 
erythroblasts. Expression of core genes in log, transformed signal intensities 
(log: SI) during differentiation of the haematopoietic stem cells into 
megakaryocytes (c) or erythroblasts (d), segregated by their trends of 
statistically significant increasing (red), decreasing (blue) or unchanged (grey) 
gene expression. The corresponding gene list for the three classes is given in 
Supplementary Data 1. 
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Table 2 | Summary of functional evidence for core genes 


Sentinel SNP (trait) Core gene (distance in kb)* Phenotypet 
rs17396340 (MPV) KIF1B kinesin family member 1B (0) Variant annotation: sentinel SNP in r = 1 with eQTL for KIF1B 
1$2336384 (PLT) MFN2 mitofusin 2 (0) Variant annotation: sentinel SNP is eQTL for MFN2 
rs10914144 (MPV, PLT) DNM3 dynamin 3 (0) shibire (DNM-like): overproliferation of plasmatocytes in Drosophila (this 
study) 
rs1172130, rs1668871 TMCC2 transmembrane and coiled-coil domain Variant annotation: sentinel SNP in * = 0.928 with eQTL for RIPK5 
(MPV, PLT) family 2 (2,481; 0) 
rs1260326 (PLT) GCKR glucokinase (hexokinase 4) regulator (0) Xab1 (non-core gene): pronounced increase in plasmatocyte and crystal cell 
counts in Drosophila (this study) 
rs649729, rs625132 EHD3 EH-domain-containing 3 (0) ehd3 morpholino-injected embryos had no haematopoietic phenotype in D. 
(MPV, PLT) rerio (this study) 
rs17030845 (PLT) THADA thyroid adenoma associated (0) Zfp36l2 (non-core gene): decreased platelet cell number in mouse 
rs1354034 (MPV, PLT) ARHGEF3 Rho guanine nucleotide exchange factor arhgef3: profound effect on thrombopoiesis and erythropoiesis in D. rerio 
(GEF) 3 (0) (this study) 
182227831, rs17568628 F2R coagulation factor Il (thrombin) receptor Parl (F2R): thrombin activation of platelets attenuated in mouse 
(MPV, PLT) (0; 15,343) 
rs700585, rs4521516 MEF2C myocyte enhancer factor 2C (0) Mef2: severely impaired megakaryopoiesis with reduced platelet count and 
(PLT, MPV) increased platelet volume in mouse 
rs2070729 (PLT) IRF1 interferon regulatory factor 1 (0) Irf1: decreased number of NK lymphocytes in mouse 
rs10076782 (MPV) RNF145 ring finger protein 145 (O) rnf145: ablation of thrombopoiesis and erythropoiesis in D. rerio (this study) 
rs210134 (PLT) BAK1 BCL2-antagonist/killer 1 (116) Bak1: genetic ablation of Bci-x/ in mouse leads to thrombocytopenia by 
reducing platelet lifespan and this is corrected by ablation of Bak1 
rs6993770 (PLT) ZFPM2 zinc finger protein, multitype 2 (0) Zfpm2: peripheral haemorrhage in mouse; ush (ZFPM2): reduction in 
plasmatocytes and crystal cells in Drosophila (this study) 
rs6995402 (PLT) PLEC1 plectin (0) Plec1: impaired leukocyte recruitment to wounds in mouse 
rs10813766 (MPV) DOCK8 dedicator of cytokinesis 8 (0) Dock8: decrease in number of B cells and T cells in mouse; autosomal 
recessive hyper-lgE recurrent infection syndrome (OMIM: 243700) 
rs409801 (PLT) AK3 adenylate kinase 3 (2,699) ak3: ablation of thrombopoiesis and erythropoiesis in D. rerio (this study) 
rs7075195, rs10761731 JMJDIC jumonji domain containing 1C (0) jmjdic: ablation of thrombopoiesis and erythropoiesis in D. rerio (this study) 
(MPV, PLT) 
rs505404, rs17655730 PSMD13 proteasome (prosome, macropain) 26S rpn9 (PSMD13): reduction in plasmatocyte numbers in Drosophila (this 
(PLT, MPV) subunit, non-ATPase, 13 (0); NLRP6 NLR family, study) 
pyrin domain containing 6 (7,856) 
rs4246215 (PLT) FEN1 flap structure-specific endonuclease 1 (0) Variant annotation: sentinel SNP is eQTL for CPSF7; Fads2 (non-core gene): 
abnormal platelet physiology and decreased platelet aggregation in mouse 
rs4938642 (PLT) CBL Cas-Br-M (murine) ecotropic retroviral Acute myeloid leukaemia (OMIM: 165360); Cb/: increased platelet 
transforming sequence (0) numbers, increased thymic CD3 and CD4 expression on T cells in mouse; 


haematopoietic stem/progenitor cells showed enhanced sensitivity to 

cytokines in Cb/-null mice 
rs10876550 (MPV) COPZ1 coatomer protein complex, subunit zeta 1 (6,604) Variant annotation: sentinel SNP is eQTL for GPR84; 

Copz1: iron deficiency in mouse; Nfe2 (non-core): thrombocytopenia in 

mouse; Znf385a (non-core): abnormal platelet morphology in mouse; 

Su(var)205 (non-core CBX5): reduction in plasmatocyte number and 

overproliferation of crystal cells in Drosophila (this study) 


rs3 184504 (PLT) SH2B3 SH2B adaptor protein 3 (0) Lnk (SH2B3): increased megakaryopoiesis and platelet count in mouse; 
increased white blood cell counts and decreased platelet count in mouse; 
rpl6 (non-core): reduced plasmatocyte and crystal cell number in Drosophila 
(this study) 
rs4148441 (PLT) ABCC4 ATP-binding cassette, sub-family C ABCC4 is an active constituent of mediator-storing granules in human 
(CFTR/MRP), member 4 (0) platelets 
rs7317038 (MPV) GRTP1 growth hormone regulated TBC protein 1 (0) Variant annotation: sentinel SNP is eQTL for GRTP1; sentinel SNP in r* = 
0.93 with eQTL for RASA3 
rs3000073 (MPV) BRF1 BRF1 homologue, subunit of RNA polymerase brf: reduction in plasmatocyte cell number in Drosophila (this study) 
Ill transcription initiation factor IIIB (S. cerevisiae) (0) 
rs3809566 (PLT) TPM1 tropomyosin 1 (alpha) (1,115) Variant annotation: sentinel SNP in r? = 1 with eQTL for TPM1; tpma (TPM1): 


total abrogation of thrombopoiesis, but normal erythropoiesis in D. rerio 
(this study) 

rs6065 (PLT) GP1BA glycoprotein Ib (platelet), alpha polypeptide (0) Bernard-Soulier syndrome (OMIM: 231200), benign Mediterranean 
macrothrombocytopenia (OMIM: 153670), pseudo-von Willebrand disease 
(OMIM: 177820); Gp1ba: giant platelets, a low platelet count and increased 
bleeding in mouse; Gp1bat’~ mice show complete inhibition of arterial 
thrombus formation and intermediate platelet numbers 

rs708382 (PLT) FAM171A2 family with sequence similarity 171, Glanzmann thrombasthenia (OMIM: 607759); decreased platelet count, 

member A2 (1,108); /TGA2B integrin, alpha 2b (7,207) abnormal platelet morphology and decreased platelet aggregation in 

mouse; itga2b: severely reduced thrombocyte function in D. rerio 

rs12969657 (MPV) CD226 CD226 molecule (0) Variant annotation: sentinel SNP is eQTL for CD226; leukocyte adhesion 
deficiency (OMIM: 116920); CD226 mediates adhesion of megakaryocytic 
cells to endothelial cells and inhibition of this diminishes megakaryocytic 
cell maturation; Dnam1 (CD226): cytotoxic T cells and NK cells less able to 
lyse tumours in mouse 


rs13042885 (MPV) SIRPA signal-regulatory protein alpha (4,163) Sirpa: mild thrombocytopenia in mouse, decreased proportion of single 
positive T cells, enhanced peritoneal macrophage phagocytosis 
rs4812048 (MPV) CTSZ cathepsin Z (5,468); TUBB1 tubulin, beta 1 (6,539) Autosomal dominant macrothrombocytopenia (OMIM: 613112); Tubb1: 


thrombocytopenia resulting from a defect in generating proplatelets in 
mouse; prolonged bleeding time and attenuated response of platelets to 
thrombin in mouse; sun (non-core ATP5E): reduction of crystal cell numbers 
in Drosophila (this study) 
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Sentinel SNP (trait) Core gene (distance in kb)* 


Phenotypet 


rs1034566 (PLT) 
syndrome (0) 


rs6141 (PLT) THPO thrombopoietin (0) 


ARVCF armadillo repeat gene deleted in velocardiofacial 


Variant annotation: sentinel SNP in = 1 with eQTL for UFDIL 


Essential thrombocythemia (OMIM: 187950); Thpo: decrease in platelet 
number and increase in platelet volume in mouse 


Information is given only for genes with a haematopoietic phenotype. A more extensive annotation of genes within associated intervals is presented in Supplementary Table 4. Information on variants associated 
with gene expression is presented in Supplementary Table 6. No evidence for a haematopoietic effect was associated with the following core genes: rs3811444 (PLT) (TRIM58 tripartite motif-containing 58 (0)); 
rs4305276 (MPV) (ANKMY1 ankyrin repeat and MYND domain containing 1 (0)); rs3792366 (PLT) (PDIAS protein disulphide isomerase family A, member 5 (0)); rs10512627 (MPV) (KALRN kalirin, RhoGEF 
kinase (0)); rs11734132 (MPV) (KIAA0232 (5,628)); rs441460 (PLT) (LRRC16A leucine-rich-repeat containing 16A (0)); rs13300663 (PLT) (RCL1 RNA terminal phosphate cyclase-like 1 (O)); rs3731211 (PLT) 
(CDKNAA cyclin-dependent kinase inhibitor 2A (0)); rs2950390, rs941207 (MPV, PLT) (PTGES3 prostaglandin E synthase 3 (cytosolic) (1,871); BAZ2A bromodomain adjacent to zinc finger domain, 2A (0)); 
rs7961894 (MPV, PLT) (WDR66 WD repeat domain 66 (0)); rs8022206 (PLT) (RAD51L1 RAD51-like 1 (S. cerevisiae) (O)); rs8006385 (PLT) (ITPK1 inositol-tetrakisphosphate 1-kinase (0)); rs2297067, rs944002 
(PLT, MPV) (C1 4orf73 exocyst complex component 3-like 4 (0)); rs8076739, rs559972 (MPV, PLT) (TAOK1 TAO kinase 1 (3,357, 0)); rs1697127 (MPV) (AP2B1 adaptor-related protein complex 2, beta 1 subunit 
(0)); rs11082304 (PLT) (CABLES1 Cdk5 and Abl enzyme substrate 1 (0)); rs8109288 (MPV, PLT) (TPM4 tropomyosin 4 (0)), rs17356664 (PLT) (EXOC3L2 exocyst complex component 3-like 2 (3,301)); 
rs2015599 (MPV) (FAR2 fatty acyl CoA reductase 2 (0)); rs397969 (PLT) (AKAP10 A kinase (PRKA) anchor protein 10 (4,506)); rs1 1789898 (PLT) (BRD3 bromodomain containing 3 (0)). 

* Core genes are defined as either containing a sentinel SNP or as mapping at less than 10 kb from an intergenic SNP. Distance from nearest gene is calculated as the absolute distance between SNP and 


transcription start site of the gene or 3’ end of last exon. 


+ Phenotypes are defined from exhaustive search of the OMIM (Online Mendelian Inheritance in Man) database, published in vitro studies for humans and knockout and knockdown experiments for model 
organisms for both core and non-core genes. r* values are calculated from the HapMap phase 2 CEU panel. Drosophila indicates Drosophila melanogaster. 


variants, which is currently incomplete. A detailed SNP survey showed 
that at 15 loci the sentinel SNPs either encoded, or were in high linkage 
disequilibrium (LD, r= 0.8) with, a non-synonymous variant 
(Supplementary Table 5); another 11 either matched or were in high 
linkage disequilibrium with SNPs associated with expression levels of 
core genes (or cis-eQTLs, Supplementary Table 6), indicating that 
other loci may exert their effect through regulation of gene express- 
ion’. The validation of suggestive causative effects, as well as the iden- 
tification of more complex interactions involving other genomic loci 
(trans eQTLs), will require a more comprehensive discovery in appro- 
priately powered genomic data sets. 

As a first effort to characterize biological connectivity among the 
core genes, we applied canonical pathway analyses (see http:// 
www.ingenuity.com), detecting a highly significant over-representa- 
tion of core genes in relevant biological functions such as haemato- 
logical disease, cancer and cell cycle (Supplementary Table 7). 
Encouraged by these results, we extended this effort to construct a 
comprehensive network of protein-protein interactions incorporat- 
ing the core genes. This effort integrated information from public 
databases (principally Reactome and IntAct) with careful manual 
revision of published evidence and high-throughput gene expression 
data. The resulting network, which includes 633 nodes and 827 edges, 
showed extensive connectivity between the proteins encoded by the 
core genes with an established functional role in megakaryopoiesis 
and platelet formation and those encoded by genes hitherto unknown 
to be implicated in these processes (Fig. 1b). 


Transcriptional patterns of core genes 


We next considered whether this connectivity was also reflected in the 
regulation of core gene transcription, and whether expression patterns 
were unique to megakaryocytes. Despite high levels of correlation in 
gene expression between different blood cell types (median = 0.8; 
median absolute deviation = 0.1)'*, we found that core genes tend to 
have significantly greater expression in megakaryocytes than in the 
other blood cells (P = 7.5 X 10°, Supplementary Fig. 5a). This obser- 
vation is compatible with the notion that ultimate steps in blood cell 
lineage specification are accompanied, or driven, by the emergence of 
increasing numbers of lineage-specific transcripts. To explore this 
assumption, we used genome-wide expression arrays to determine 
changes in global transcript levels during in vitro differentiation of 
umbilical-cord blood-derived haematopoietic stem cells to precursors 
of blood cells. We considered five different time points and two cell types, 
erythroblasts (the precursors of red blood cells) and megakaryocytes. 
Notwithstanding high levels of correlation of gene expression between 
erythroblasts and megakaryocytes", core gene transcripts showed a 
significant increase over time in megakaryocytes (P= 1.510 °) 
but not in erythroblasts (P = 0.77, Fig. 1c, d; see also Supplementary 
Fig. 5b). Taken together, these patterns of core gene expression 
are consistent with a different regulation of their transcription in 


megakaryocytes versus erythroblasts, and with their centrality in 
megakaryopoiesis and platelet formation. This hypothesis is also con- 
sistent with the observation that only 5 of the 68 sentinel SNPs exert a 
significant effect on erythrocyte parameters (HBSIL-MYB, RCLI, 
SH2B3, TRIM58 and TMCC2, Supplementary Table 8). 


Gene silencing in model organisms 


To assess whether core genes are indeed implicated in haematopoiesis, 
we interrogated the function of 15 genes using gene silencing in D. rerio 
and D. melanogaster, and supported empirical data with published 
evidence on knockout models in M. musculus (Table 2 and Sup- 
plementary Table 4). In D. rerio, we applied morpholino constructs 
to silence the expression of six genes (Fig. 2 and Supplementary Fig. 6) 
selected to have >50% homology with the human counterpart and no 
previous evidence of involvement in haematopoiesis. Silencing of four 
genes in D. rerio (arhgef3, ak3, rnf145, jmjd1c) resulted in the ablation 
of both primitive erythropoiesis and thrombocyte formation. 
Silencing of tpma, the orthologue of TPM1 that is transcribed in 
megakaryocytes but not in other blood cells, abolished the formation 
of thrombocytes but not of erythrocytes. Silencing of ehd3 did not 
yield a haematopoietic phenotype. We also screened D. melanogaster 
RNA interference (RNAi) knockdown lines for quantitative alterations 
in the two most prevalent classes of blood elements: plasmatocytes and 
crystal cells. The repertoire of blood cells in D. melanogaster, consisting 
of about 95% plasmatocytes and 5% crystal cells, is less varied than in 
vertebrates. Transcription factors and signalling pathways regulating 
haematopoiesis have, however, been conserved throughout evolu- 
tion’, making the RNAi knockdown studies a relevant first step 
towards a better understanding of the putative role of these GWAS 
genes in haematopoiesis. Four core-gene D. melanogaster lines (shibire 
(DNM), ush (ZFPM2), rpn9 (PSMD13), Brf (BRF1)), as well as five 
others (sun (ATP5E), CG3704 (XAB1), Su(var)205 (CBX5), dve 
(SATB1) and RpLé (RPL6)), displayed highly reproducible differences 
in the numbers of these two cell types (Table 2 and Supplementary 
Table 4). Despite widespread differences between mammalian and 
insect haematopoietic lineages’*, our findings from D. melanogaster 
provide new and supporting examples of functional conservation in 
the control of blood cell formation in invertebrates and vertebrates'”-’. 


New gene and functional discoveries 

The data from studies in D. rerio by us and in M. musculus by others 
(see Supplementary Table 4) provided proof-of-concept evidence that 
our prioritization strategy is appropriate for selecting novel genes 
controlling thrombopoiesis and megakaryopoiesis, respectively. 
More detailed insights and additional implicated genes will be 
revealed through the systematic silencing of all genes in the associated 
regions. For instance, RNAi knockdown of dve in D. melanogaster 
reduces plasmatocyte numbers and increases the number of crystal 
cells, thus providing supporting evidence that its non-core gene 
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Figure 2 | Functional assessment of novel loci in D. rerio. Gene-specific 
morpholinos were injected into wild-type and Tg(cd41:EGFP) embryos at the 
one cell stage (Supplementary Fig. 6) to assess alterations in erythropoiesis and 
thrombopoiesis. a, Control D. rerio embryo at 72 h post fertilization (h.p.f.); the 
boxed region corresponds to images in the middle panels of b-h. b-h, Left: 
o-dianisidine staining was used to assess the number of mature erythrocytes at 
48 h.p.f.: ehd3 (h) morpholino-injected embryos showed normal haemoglobin 
staining, whereas embryos injected with ak3 (c), rnf145 (d), arhgef3 (e) or 
jmjdic (g) morpholinos showed a decrease in the number of haemoglobin- 
positive cells compared to control embryos (b). Embryos injected with tpma 
morpholinos (f) showed normal numbers of erythrocytes but unusual 
accumulation dorsally in the blood vessels (compatible with cardiomyopathy). 
Middle: haematopoietic stem-cell and thrombocyte development was assessed 
using the transgenic T'g(cd41:EGFP) line at 72 h.p.f. Embryos injected with the 
ehd3 (h) morpholino had a normal number of GEP* cells in the caudal 
haematopoietic tissue and circulation, when compared to control embryos 
(b). However, GFP* cells were absent in ak3 (c), rnf145 (d), arhgef3 (e), tpma 
(f) and jmjd1c (g) morpholino-injected embryos. Right: One-cell-stage 
embryos were injected with the standard control morpholino (b) or gene- 
specific morpholino (c-h) and monitored during development. No gross 
lethality or developmental abnormalities were observed at 72 h.p.f. in gene- 
specific morpholino-injected embryos (c—h) compared with the control 
(b).aand middle and right panels of b-h, lateral view, anterior left; left panels of 
b-e, g, h, ventral view, anterior up; left panel of f, dorsal view, anterior up. The 
genes appear to be nonspecifically expressed during embryogenesis as shown 
by patterns deposited in the ZFIN resource (http://zfin.org). 
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human homologue SATB1 should be prioritized in functional studies. 
However, the results of the knockdown study in D. rerio do not clarify 
at which hierarchical positions in thrombopoiesis and erythropoiesis 
the genes exert their effect, requiring further assessment in con- 
ditional knockout models in M. musculus with lineage-specific regu- 
lation of gene transcription. Nevertheless, our results have already 
allowed novel insights into the genetic control of these processes. 
Signalling cascades initiated by thrombopoietin (THPO) and its 
receptor cMPL via the JAK2/STAT3/5A signalling pathway are key 
regulatory steps initiating changes in gene expression responsible for 
driving forward megakaryocyte differentiation*®. Our study high- 
lights several additional signalling proteins implicating potentially 
important novel regulatory routes. For instance, two genes encoding 
guanine nucleotide exchange factors (DOCK8 and ARHGEF3) were 
identified. Mendelian mutations of the former are causative of the 
hyper-IgE syndrome, but its effect on platelets had not yet been iden- 
tified. The silencing of the latter gene in D. rerio resulted in a profound 
haematopoietic phenotype characterized by a complete ablation of 
both primitive erythropoiesis and thrombocyte formation, demon- 
strating its novel regulatory role in myeloid differentiation. In a parallel 
and in-depth study we demonstrated its novel role in the regulation of 
iron uptake and erythroid cell maturation’'. A second class of genes 
also known to critically control early and late events of megakaryopoi- 
esis are transcription factors. For instance, MYB silencing by 
microRNA 150 determines the definitive commitment of the mega- 
karyocyte-erythroblast precursor to the megakaryocytic lineage’. A 
further 10 core genes identified in this study are implicated in the 
regulation of transcription. Among these, we have demonstrated here 
that silencing of rnf145 and jmjdic in D. rerio severely affects both 
lineages. 

In conclusion, this highly powered study describes a catalogue of 
known and novel genes associated with key haematopoietic processes 
in humans, providing an additional example of GWAS leading to 
biological discoveries. We further showed that for a large proportion 
of these known and new genes, functional support is achieved from 
model organisms and by overlap with genes implicated in inherited 
Mendelian disorders and in human cancers because of acquired muta- 
tions. In-depth functional studies and comparative analyses will be 
necessary to characterize the precise mechanisms by which these 
new genes and variants affect haematopoiesis, megakaryopoiesis and 
platelet formation. Furthermore, we provide extensive new resources, 
most notably a freely accessible knowledge base embedded in the novel 
protein-protein interaction network, with information about the iden- 
tified platelet genes being implicated in Mendelian disorders and 
results from gene-silencing studies in model organisms. We anticipate 
that these resources will help to advance megakaryopoiesis research, to 
address key questions in blood stem-cell biology and to propose new 
targets for the treatment of haematological disorders. Finally, MPV has 
been associated with the risk of myocardial infarction**”*. The contri- 
bution of the new loci to the aetiology of acute myocardial infarction 
events will require assessment in a prospective setting. 


METHODS SUMMARY 


A summary of the methods can be found in Supplementary Information and 
includes detailed information on: study populations; blood biochemistry mea- 
surements; genotyping methods and quality control filters; genome-wide asso- 
ciation and meta-analysis methods; gene prioritization strategies for functional 
assessment and network construction; protein-protein interaction network; in 
vitro differentiation of blood cells; experimental data sets and analytical methods 
for gene expression analysis; zebrafish morpholino knockdown generation; 
assessment of other model organism resources. 
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The circadian molecular clock creates 
epidermal stem cell heterogeneity 


Peggy Janich’, Gloria Pascual’, Anna Merlos-Suarez’, Eduard Batlle*’, Jiirgen Ripperger*, Urs Albrecht*, Hai- Ying M. Cheng®, 


Karl Obrietan®, Luciano Di Croce’? & Salvador Aznar Benitah!? 


Murine epidermal stem cells undergo alternate cycles of dormancy and activation, fuelling tissue renewal. However, only a 
subset of stem cells becomes active during each round of morphogenesis, indicating that stem cells coexist in heterogeneous 
responsive states. Using a circadian-clock reporter-mouse model, here we show that the dormant hair-follicle stem cell 
niche contains coexisting populations of cells at opposite phases of the clock, which are differentially predisposed to 
respond to homeostatic cues. The core clock protein Bmall modulates the expression of stem cell regulatory genes in an 
oscillatory manner, to create populations that are either predisposed, or less prone, to activation. Disrupting this clock 
equilibrium, through deletion of Bmall (also known as Arntl) or Per1/2, resulted in a progressive accumulation or depletion 
of dormant stem cells, respectively. Stem cell arrhythmia also led to premature epidermal ageing, and a reduction in the 
development of squamous tumours. Our results indicate that the circadian clock fine-tunes the temporal behaviour of 
epidermal stem cells, and that its perturbation affects homeostasis and the predisposition to tumorigenesis. 


Epidermal stem cells ensure that skin homeostasis is maintained. 
Murine epidermal stem cells are located either at the permanent portion 
of the hair follicle—termed the bulge—and are exclusively responsible 
for hair cycling’; or at the junction between the epidermis and the hair 
follicle (isthmus), and feed into the epidermis and sebaceous glands*’. 
In addition, a continuous proliferation of basal interfollicular epidermal 
cells ensures daily epidermal maintenance’. 

Bulge stem cells undergo bouts of activation followed by periods of 
dormancy, to establish hair follicle cycling. Robust TGF-8$ and Bmp 
signals act as ‘activation breaks’, rendering bulge cells dormant during 
the resting phase of the hair cycle (telogen)”"''. At the onset of the 
growth phase (anagen), bulge cells respond to Wnt signals by migrating 
into the lower proliferative hair germ region, where they contribute to 
follicle growth'"’*. Subsequently, at mid-anagen, the bulge undergoes a 
second round of activation, which replenishes cells lost at the onset of 
anagen*’. However, the response of bulge stem cells to activating stimuli 
is a heterogeneous process, as only a subset of them become active 
during either stage of activation’*’*. The nature of such niche hetero- 
geneity is currently unknown. Importantly, perturbing the equilibrium 
between the responsive and non-responsive stem cell states causes 
tissue malfunction and increases the risk of carcinogenesis'°”°. 

Here, we analysed the role of the molecular clock in fine-tuning the 
function of epidermal stem cells. The mammalian clock machinery 
anticipates and synchronizes vital functions related to the physio- 
logical circadian needs of the organism*’. The core molecular clock 
is established by a positive limb, composed of heterodimers of the 
transcription factors Clock and Bmal, which drives the rhythmic 
expression of the negative limb proteins, period (Perl-3), crypto- 
chrome (Cry1/2), and Dec1/2 (ref. 21). Accumulated Per and Cry 
proteins in turn translocate to the nucleus and inhibit Bmal/Clock 
transcriptional activity, thereby repressing their own expression and 
marking the beginning of a new cycle. 


Circadian heterogeneity in hair-follicle stem cells 


We monitored the activity of the clock in epidermal stem cells by 
means ofa reporter bacterial artificial chromosome (BAC) transgenic 


mouse, in which the expression of fluorescent venus is under the 
regulation of the full-length promoter of the Perl gene”. Venus 
expression mirrors the endogenous oscillation of the clock in the 
suprachiasmatic nuclei, where the central pacemaker resides, thereby 
establishing its bona fide circadian reporter activity’. We first studied 
the behaviour of the clock in the dorsal skin of Perl-venus mice 
collected between postnatal days (P)19 and P31, when hair follicles 
synchronously transit from the dormant to the growth phase. 

At P19, a stage in which the bulge is predominantly dormant, the 
bulge cells expressing CD34 and the highest levels of «6 integrin («6 
integrin’™S"'/CD34*) contained a continuum of venus-expressing 
populations, which could be subdivided into cells with highest mean 
fluorescence intensity of venus (venus?"**), and those with no venus 
fluorescence (venus®"), as determined by immunohistochemistry 
(Supplementary Fig. 1), and fluorescence-activated cell sorting 
(FACS) (Fig. la, b). At this stage, the bulge contained approximately 
equal numbers of venus?" and venus*™ stem cells (Fig. 1a, b). As 
hair follicles synchronously progressed into anagen (from P20 to 
P31), the proportion of venus?™sht bulge cells steadily increased, 
reaching a proportion of 90% venus?"®" to 10% venus“ at the peak 
of follicle growth (Fig. 1a, b). Conversely, the basal layer of the inter- 
follicular epidermis («6 integrin’™S"/ CD34) showed a homogenous 
pattern of clock activity with a ratio of 95:5 venus?#"t.venusti™ cells, 
irrespective of the day analysed (Supplementary Fig. 2). Heterogeneity 
of the clock phase in bulge cells during the telogen to anagen trans- 
ition was also observed with another independently generated 
reporter line, Perl-GFP, as shown by three-dimensional whole- 
mount GFP fluorescence imaging of tail epidermis (Supplementary 
Fig. 3a, b)”*. 

We next verified whether these clock stem cells states showed 
circadian rhythmicity. FACS analysis indicated that the expression 
of venus in bulge stem cells in anagen (P27), or interfolllicular 
epidermis basal cells, followed a circadian pattern, irrespective of 
whether the mice were maintained in 12h light/dark cycles, or in 
constant darkness under free running conditions (Fig. 1c and Sup- 
plementary Fig. 4a, b)”’. This circadian variation was further confirmed 
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Figure 1 | The molecular clock regulates the expression of the bulge stem 
cell signature. a, b, Percentages of venus” ®" and venus“ cells in the bulge by 
FACS (n = 6). c, Venus mean fluorescence intensity of bulge cells in mice kept 
under 12h light/12h dark (LD) and 12h dark/12h dark (DD) conditions 


(n = 2). d, Heatmap of selected genes from arrays of venus”™®*' and venus“’” 


by time-lapse in vivo confocal microscopy of venus fluorescence in 
dorsal skin explants biopsied from adult Perl-venus mice, and GFP 
fluorescence in whole mounts of tail epidermis of Perl-GFP mice 
(Supplementary Fig. 4c, d). 


The clock regulates stem cell genes 


We then compared the global transcriptomes of purified venus 
and venus™ bulge cells from the dorsal skin of P18-19 mice («6°"8"*/ 
CD34* /venus?""t and «6?"8"'/CD34* /venus“™). As expected, both 
populations showed differential expression of core circadian tran- 
scripts, such as Cry2, Perl, Nr1d1, Rorf, Dec2 and E4BP4 (also known 
as Nfil3) (Fig. 1d and Supplementary Table 1). Intriguingly, although 
the bulge is inactive at P18-19, both populations differed in the 
expression of a significant number of genes previously shown to con- 
stitute the bulge signature’*"?*4, Venus"? bulge stem cells 
expressed higher levels (between 1.4- to 3-fold) of Wnt-signalling 
factors, including Tcf3, Fzd2/3, Sox9, Lhx2, Lgr5, Lefl, Dkk3 and 
Dab2, as well as TGF-B-inhibitory factors such as Smad7, Ltbp2-4, 
Smurf, Lefty and Cull (Fig. 1d, e, and Supplementary Table 1). Other 
pathways differentially expressed relevant for bulge behaviour related 
to integrins, Notch, Bmp and Shh, among others (Fig. 1d, e). 

Thus, the coexisting clock states of dormant bulge stem cells cor- 
related with differential expression of key epidermal homeostasis 
genes. Promoter analysis revealed that several of these genes, encod- 
ing for proteins known to control bulge dormancy, activation, differ- 
entiation and niche interactions’, contained several putative Bmal1/ 
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bulge cells of P19 mice (n = 3). e, Validation of microarray data by real-time 
PCR. Fold change is shown as relative to venus“ cells after normalization to 
pumilio 1 (Pum1) (n = 2, pool of 6 mice per replicate). Results in b, c and e are 
shown as mean = s.e.m., *P < 0.05, **P < 0.01, ***P < 0.001 (two-tailed 
Student’s t-test). 


Clock-binding sites within their proximal and distal promoter regions 
(Supplementary Table 2). These included the Wnt signalling factors 
Dab2, Lef1, Dkk3, Fzd2, Sox9, Lhx2 and Tcf4; TGF-B regulators such 
as Smad7, Lefty, Smurf2 and Smad9; and Itga6 as well as modulators of 
Bmp and Notch signalling (Supplementary Table 2). We confirmed 
by chromatin immunoprecipitation (ChIP) that Bmal1/Clock bound 
to these promoters in intact adult tail epidermis (Supplementary 
Fig. 5a), and that the binding of Bmall was circadian (Fig. 2). 
Chromatin occupancy of Bmall to these genes was also confirmed 
in FACS-sorted bulge stem cells (Supplementary Fig. 5b). 


Clock arrhythmia affects homeostasis 


We next sought to study the biological significance of this clock 
mechanism in epidermal stem cells in vivo. To this end, we generated 
mice with a conditional deletion of Bmall in the keratin-14* basal 
keratinocyte compartment (K14Cre/ Bmal1!°*?/“°*” | hereafter refered 
as BmallKO) (Supplementary Fig. 6a, b). Deletion of Bmall1 causes 
circadian arrhythmicity without the need to perturb any other core 
circadian member”, and its ubiquitous deletion causes premature 
ageing, including defects in adult skin morphogenesis”-*’. We crossed 
BmallKO and Perl-venus mice, and verified that the circadian clock 
of bulge stem cells and basal interfollicular epidermal cells was 
arrhythmic, and permanently skewed towards a clock’ (venus“”) 
state (Supplementary Fig. 6c). 

Bulge stem cells and primary keratinocytes isolated from the dorsal 
skin of Bmall KO mice expressed lower transcript levels of Wnt-related 
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Figure 2 | Circadian binding of Bmal to the promoters of genes involved in 
adhesion, cell cycle, TGF-B and Wnt pathways. ChIP from tail epidermis of 
wild-type mice collected every 3 h during 24h (white and black bars represent 
day and night, respectively). Graph shows percentage of immunoprecipitated 
DNA over an input control from one representative experiment (” = 2). 


genes including Dab2, Dkk3, Lef1 and Wnt10a, than control stem cells 
(Fig. 3a and Supplementary Fig. 5c). In addition, they contained lower 
messenger RNA levels of TGF-B inhibitors, and higher amounts of 
Tefbr2 and Smad3 (Fig. 3a and Supplementary Fig. 5c). The expression 
of Wnt and TGF-B-related factors in purified bulge stem cells varied 
within a 12h period in wild-type mice, but not BmallKO mice 
(Supplementary Fig. 5d). Accordingly, the hair follicles of BmallKO 
mice showed the same differences at the protein level, as exemplified by 
immunohistochemical analysis of Sox9, Lefl, phospho-Smad2 and 
TGFBR2 in dorsal skin sections, and western blot from primary 
keratinocytes (Fig. 3b, d). 

Altogether, these results indicated that the clock machinery might 
endow subpopulations of epidermal stem cells with different predis- 
positions to respond to dormancy or activation stimuli, such as 
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Figure 3 | Bmall modulates the response of bulge stem cells to activation 
and dormancy cues. a, Differential expression of genes in bulge cells of 
BmallWT and Bmall1KO mice. Fold-change values are shown as relative 
expression to BmallWT cells after normalization to Pum! (n = 2). b, Western 
blot analysis for phospho-Smad2 (p-Smad2), Lef1 and Sox9 in tail epidermis of 
9-month-old Bmall WT and Bmal1KO mice; n = 3 mice were analysed for each 
group; c, Primary mouse keratinocytes of BmallKO show enhanced 
responsiveness to TGF-f1 after 48 h of treatment (n = 3). d, Immunostaining 
for phospho-Smad2, TGFBR2, Lefl and Sox9 in back skin of BmallWT and 
BmallKO mice (n = 5). Scale bars 25 jum. e, Clonogenic assay of FACS-purified 
venus?"®* and venus“ bulge and interfollicular epidermis (IFE) keratinocytes 
from the back skin of P19 Perl-venus mice (7 X 10° bulge and 1 x 10° 
epidermal cells). Results ina and care shown as mean = s.e.m., *P < 0.05 (two- 
tailed Student’s t-test). 


Fzd2  Dkk3 = Sox9 Smad7 Lefty 


Results are shown as mean + s.e.m., **P < 0.01, ***P < 0.001 (one-way 
ANOVA). Statistical significance determined by Cosinor analysis revealed a P 
value of P < 0.05 for Dbp and Cdk4, and P < 0.001 for Itga6, Lefl, Smad7, Sox9 
and Tcf4. 


TGF-B and Wnt. In line with this, the hair follicle bulge and inter- 
follicular epidermis of Bmall KO mice contained higher levels of active 
phospho-Smad2 (Fig. 3b, d). Moreover, BmallKO keratinocytes were 
more responsive to TGF-f treatment than control keratinocytes 
(Fig. 3c). We could not detect any Wnt activity using the Wnt-specific 
reporter containing binding sites for TCF/Lef proteins (TOP-Flash) in 
our primary cultures of mouse keratinocytes stimulated with the 
GSK3b inhibitor 6-bromoindirubin-3-oxime (BIO), or purified 
Wnit3a, in accordance with previous reports”; this prevented us from 
further studying the effect of Bmal1 deletion on Wnt responsiveness 
(Supplementary Fig. 7). 

When plated at clonal density, Perl—-venus bulge stem cells 
and basal epidermal cells Showed a higher growth potential than 
the corresponding venus“ population, further suggesting that the 
clock" state is more prone to become activated than the clock'°” 
counterpart (Fig. 3e). Additional data confirmed this hypothesis. 
First, the hair follicle bulges of Bmall KO mice, which are permanently 
locked in the clock!°” state, contained fewer proliferative cells and a 
higher number of long-term label (BrdU)-retaining dormant stem 
cells (LRCs), from 10 months of age and onwards (Fig. 4a). Second, 
the epidermis of another model of circadian arrhythmia, Perl and 
Per2 double-mutant mice (Perl1/2dKO), which lacks the negative limb 
of the molecular clock and is therefore locked in the clock"®" state, 
showed the opposite effects; that is, enhanced bulge proliferation, 
reduced numbers of bulge LRCs and sustained expression of epidermal 
clock target genes” (Fig. 4b and Supplementary Fig. 8). The hair 
follicles of BmallKO were less efficient in becoming active upon 
depilation (Supplementary Fig. 9a). Bmal1 KO bulge cells were also less 
hyperprolifertive than wild-type bulge cells in response to treatment 
with the phorbol ester 12-O-tetradecanoylphorbol-13-acetate (TPA), 
thus delaying the entry of the hair follicles into anagen (Supplementary 
Fig. 9b). Lastly, epidermal stem cells purified from Bmal1 KO and Per1/ 
2dKO mice were less and more clonogenic, respectively, in vitro than 
wild-type cells (Supplementary Fig. 10a, b). Of note, deletion of Bmall 
or Per1/2 did not affect the proportion of bulge stem cells in adulthood 
compared to their controls (Supplementary Fig. 11a, b). 


bright 


Loss of Bmall induces epidermal ageing 


Reduced and enhanced proliferation, respectively, was also evident in 
the basal layer of the interfollicular epidermis of BmallKO and Per1/ 
2dKO mice (Supplementary Fig. 12). Bmall KO mice showed signs of 
inefficient epidermal self-renewal, with premature signs of ageing as 
early as 5 months of age, such as the accumulation of terminally 
differentiated cornified cells (Fig. 4c and Supplementary Fig. 9c). 
This was accompanied by increased expression of p16, which has been 
previously associated with increased epidermal ageing”, but not p19 
or apoptosis (Supplementary Fig. 13). Because bulge stem cells do not 
contribute to epidermal maintenance in steady-state conditions*', we 
sought to understand the molecular mechanisms underlying the defects 
of the interfollicular epidermis of BmallKO mice. We performed 


8 DECEMBER 2011 | VOL 480 | NATURE | 211 


©2011 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 
Bmal1WT Bmal1KO 
ay =| | BmaltkKO 
i Bmalt wT 
é 30 : 


B25 
=) 

2 20 
815 
3 10 


i Bmat wT 
NS 


—1 


10-12 months 


[J Bmal1kKO 
Bi Bmal WT 


10, 


Bmal1WT 


8I- 


6|. 


Cornified layer (arbitrary units) 
Bmal1KO 


Figure 4 | Clock perturbation in vivo results in changes in the number of 
dormant bulge stem cells, and premature epidermal ageing. 

a, b, Quantification of BrdU* LRCs and Ki67* cells in the bulge of 10-month- 
old Bmall WT and Bmal1KO mice (a) and 10-week-old wild type (WT) and 
Per1/2dKO mice (b), showing opposite phenotypes (n = 4; 9 follicles per 


microarray analysis of purified basal interfollicular epidermal pro- 
genitors (x6>"84*/CD34"°8), from 10-month-old BmallKO mice and 
their control littermates (Supplementary Table 3). As expected, cells 
from BmallKO mice showed strong differential expression of most of 
the core circadian transcripts, including the microRNA miRNA-122 
(ref. 32), some of which were validated independently by real-time 
quantitative PCR with reverse transcription (RT-qPCR) (Supplemen- 
tary Fig. 14a and Supplementary Table 3). Interestingly, Gene 
Ontology (GO) analysis indicated that the cell cycle, energy and drug 
metabolism, calcium-sensing proteins, the epidermal barrier response 
and chromatin compaction were significantly affected upon deletion of 
Bmall (Supplementary Fig. 14b). Intriguingly, although BmallKO 
mice showed a hyperkeratotic phenotype, the viable epidermal layers 
expressed lower levels of terminal differentiation markers, including 
Flg, Lor, Sprr1, Lce genes and Tgm (Supplementary Fig. 14c and 
Supplementary Table 3). Interestingly, Bmal1KO cells also expressed 
lower amounts of the miRNA-23b/-27b/-24-1 cluster, which targets 
TGFBR2 and Smad proteins* (Supplementary Fig. 14d). The reduced 
levels of epidermal differentiation genes probably reflects the lower 
efficiency of activation of basal interfollicular epidermal cells in 
BmallKO mice, suggesting that the hyperkeratotic phenotype 
developed as a compensatory mechanism to ensure a certain degree 
of epidermal barrier protection. 


Loss of Bmall reduces skin tumorigenesis 


Because perturbation of the clock machinery affects the predisposi- 
tion of certain tissues to carcinogenesis™, we next studied whether 
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mouse were analysed). c, Histological analysis and immunostaining for 
filaggrin and loricrin (red) in 10-12-months-old Bmall WT and BmallKO 
mice. Graph shows thickness quantification of the cornified layer ( = 13). 
Scale bars, 100 jum. Results are shown as mean = s.e.m., *P < 0.05, **P < 0.01, 
***P < ().001 (two-tailed Student’s t-test). NS, not significant. 


epidermal deletion of Bmal1 had any impact on the development of 
cutaneous squamous tumours. To this end, we crossed BmallKO 
mice with a transgenic line expressing oncogenic Sos, an activator 
of Ras, under the regulation of the Krt5 promoter (K5-SOS)**. In an 
EGFR mutant-heterozygous background, K5-SOS mice sponta- 
neously developed squamous tumours, primarily in the tail, with 
100% penetrance, as previously described**. BmallKO/K5-SOS mice 
developed significantly fewer neoplastic lesions at early-, mid- and 
late-stages of carcinoma development than control mice (Fig. 5a and 
Supplementary Fig. 15a). The skin lesions of Bmall KO/K5-SOS mice 
were more differentiated—as determined by increased expression of 
involucrin and loricrin—contained large cornified islands and a 
higher number of apoptotic areas, as compared to control tumours 
(Supplementary Fig. 15b). Control mice had to be killed by two 
months of age, a time at which no BmallKO mice had developed 
the number, or size, of tumours. The growth of cutaneous squamous 
tumours has been shown to depend on Wnt activity in a population of 
CD34* tumour-initiating cells**. However, we could not detect any 
nuclear B-catenin, either in control or in Bmall KO neoplastic lesions, 
suggesting that, in our model, tumour growth did not primarily 
depend on misregulated Wnt signalling (Supplementary Fig. 16a, 
b). We did observe a significant reduction in the percentage of 
o6°"8''/CD34* tumour-initiating cells in BmalKO tumours with 
respect to wild-type tumours (Fig. 5b and Supplementary Fig. 17). 
Notably, whereas wild-type tumours expressed «6 integrin in basal 
and suprabasal cells, which has been previously associated with 
increased malignancy’’, BmallKO tumours only expressed it in basal 
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Figure 5 | Loss of Bmal1 reduces the development of squamous tumours. 
a, Quantification of tumour weight of BmallWT/K5-SOS and Bmall1KO/K5- 
SOS littermates (n = 5). b, FACS analysis and quantification of «6°"8"'/CD34* 
tumour-initiating cells (P36 mice). Numbers represent percentage of «6°"2"/ 
CD34" cells from gated epithelial cells (n = 3) c, Histological analysis and 
immunostaining for «6-integrin (arrows indicate suprabasal expression). Scale 
bars, 50 pum. Results in a and b are shown as mean + s.e.m., *P < 0.05 (two- 
tailed Student’s t-test). 


cells (Fig. 5c). The reduction in the percentage of tumour-initiating 
cells, together with the high expression of several tumour suppressors, 
in BmallKO epidermal progenitors, including the non-coding RNA 
H19 (top upregulated gene; Supplementary Table 3), probably con- 
tribute to the reduced burden of squamous lesions in Bmall KO mice. 


Discussion 


Our results indicate that the molecular clock establishes stem cell 
states that are differentially predisposed to respond to activation 
and dormancy stimuli. The clock machinery controls the expression 
of essential epidermal stem cell regulators to anticipate the require- 
ments of the tissue. In this sense, in the epidermal compartment 
where quiescent stem cells exist (that is, the bulge), the clock estab- 
lishes a population of ‘ready-to-go’ cells that can rapidly and effi- 
ciently respond to activation stimuli, while simultaneously 
preventing all stem cells within the niche from becoming responsive. 
Future studies will be necessary to determine when, and how, this 
stem cell heterogeneity is established. On the other hand, the murine 
interfollicular epidermis, which primarily depends on continuously 
cycling basal progenitors rather than dormant stem cells** for its 
renewal, shows a much more homogenous clock activity (albeit con- 
taining approximately 5% of cells antiphasic with the majority). In 
this compartment the clock machinery might predominantly estab- 
lish a correct timing of stem cell activation and differentiation. 
Conditional deletion of Bmall in liver, retina and pancreas results 
in profound defects in tissue function®”*', and haematopoietic stem 
cells show a Bmal1/Clock-dependent circadian release to the periphery”. 
Altogether, these findings indicate that the clock machinery may con- 
stitute a fine-tuning homeostatic mechanism in tissues in which 
dormant and active populations of stem cells coexist. Unbalancing 
the epidermal stem cell clock not only substantially affected long- 
term tissue homeostasis, but also the predisposition of the tissue to 
undergo neoplastic transformation. It is likely that perturbations of 
this clock-controlled mechanism over stem cell regulation in humans 
may have long-term consequences on tissue homeostasis, ageing and 
carcinogenesis. 


METHODS SUMMARY 


For the isolation of epidermal cells from back or tail skin, the skin was incubated 
in 0.25% trypsin for 2h at 37°C, or overnight at 4°C, to separate the dermis 
from the epidermis. Back and tail keratinocytes were extracted as described 
previously*. 

For ChIP assays, cells in suspension were cross-linked for 10 min at room 
temperature (22-25°C) in 1% formaldehyde. Cross-linking reactions were 
stopped by adding 1.25 M glycine to a final concentration of 125 mM. Cells were 
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centrifuged at 300g for 10 min at 4 °C and washed in cold PBS. Cell lysis, sonifica- 
tion and ChIP assays were performed using the MAGnify Chromatin 
Immunoprecipitation System (Invitrogen). 

For FACS analysis or sorting of bulge and epidermal stem cells, cell suspen- 
sions were incubated for 30min on ice with PE-conjugated anti-«6-integrin 
(CD49f clone NKI-GoH3, Serotec) and biotin-conjugated anti-CD34 (clone 
RAM34, BD Pharmingen) antibodies followed by APC-conjugated streptavidin 
(BD Pharmingen) for 20 min. Dead cells were excluded by 4',6-diamidino-2- 
phenylindole (DAPI) incorporation. FACS analysis and sorting were performed 
using LSRII FACS Analysers, FACSArialIl, FACSDiva (BD Biosciences) and 
Flowjo software. 

For microarray analysis, total RNA was isolated from FACS-sorted cells using 
Trizol extraction and RNeasy Micro Kit (Qiagen). Transcriptional profiling was 
performed using GeneChip Mouse Gene 1.0 ST Array (Affymetrix) and func- 
tional analysis was performed using DAVID Bioinformatics Resources 6.7. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. Bmal1'®*”°*? and K14Cre mice purchased from The Jackson 
Laboratory were crossed with each other to obtain K14Cre/Bmall””™ 
(Bmall WT) and K14Cre/Bmal1’°*?/“°" (Bmal1 KO), and with K5-SOS mice to 
generate K14Cre/Bmall“”'/K5-SOS (BmallWT/K5SOS) and K14Cre/ 
Bmali'*"'*"/K5-SOS (BmallKO/K5SOS) littermate controls**. Perl-GFP 
and Perl-venus mice have been described previously??”*. K14Cre/Bmal1””' 
Perl-venus and K14Cre/Bmal1’*"""°*? Perl-venus mice were generated by 
crossing K14Cre/Bmall“”™ and K14Cre/Bmal1'°*?/**? animals with Perl- 
venus mice, respectively. Perl and Per2 double-mutant mice (Per1/2dKO) have 
been described previously, and were compared to age and sex-matched control 
animals to avoid differences in hair cycle behaviour between males and females”. 
Mice were housed in an AAALAC-I approved animal unit under 12h light/12h 
dark or 12h dark/12 h dark cycles, and SPF conditions, and all procedures were 
approved by the CEEA (Ethical Committee for Animal Experimentation) of the 
Government of Catalonia. For experiments in constant darkness, light was turned 
off at Zeitgeber time (ZT)12 and animals were housed in 12h dark/12h dark 
conditions for 5 days. For 5-bromo-2-deoxyuridine (BrdU)-labelling experi- 
ments, 100 gg * BrdU (Invitrogen) was injected intraperitoneally into the mice 
and chased for 10 weeks or up to 10 months, as indicated. To activate epidermal 
proliferation, back skin or tail skin of 10-week-old Bmall WT and Bmal1 KO mice 
was treated three times with 20nM TPA (Sigma-Aldrich) during one week. For 
depilation experiments, dorsal skin of 15-month-old BmallWT and BmallKO 
mice in telogen phase was depilated with cold wax strips (Taky). 

Primary mouse keratinocyte cultures. Primary mouse keratinocytes from 
newborn mice or tail skin of adult mice were isolated as described previously*™*. 
Cells were plated in EMEM (Lonza) containing 4% chelated FBS, 1% penicillin/ 
streptomycin and 20 nM calcium for 24h; medium was then changed to growth 
medium (EMEM with 4% chelated FBS, 1% penicillin/streptomycin, EGF (10 ng 
ml—') and 50nM calcium). For the time-course experiments of keratinocytes 
isolated from BmallWT or BmallKO mice, cells were synchronized by a serum 
shock with growth medium, containing 50% chelated FBS, for 2 h*. For TGF-B 
treatments, keratinocytes were cultured in the presence of TGF-B1 (2ngml |; 
PreproTech) for 48h. For clonogenic assays, the indicated number of FACS- 
sorted keratinocytes were plated in E-medium on mitomycin (Sigma-Aldrich)- 
treated J2 3T3 feeder cells**. Keratinocytes were grown at 37 °C with 5% COs. 
Whole-mount immunofluorescence. Preparation of tail skin and whole-mount 
stainings were performed as previously described*’. Primary and secondary anti- 
bodies were incubated overnight and used at the following concentrations: 
1:1,000 for anti-GFP (A11122, Invitrogen); 1:250 for anti-BrdU (Serotec) and 
anti-Ki67 (ab15580, Abcam; clone B56, BD Pharmingen); and 1:500 for anti- 
rabbit and anti-rat conjugated to AlexaFluor488 or AlexaFluor594 (Molecular 
Probes). Nuclei were stained with DAPI (1:5000; Roche), and epidermal sheets 
were mounted in Mowiol. Pictures were acquired with a Leica TCS SP5 confocal 
microscope. 

Immunohistochemistry. Back skin and tail skin were either embedded in OCT, 
or fixed in 4% NBF (Sigma-Aldrich) at 4 °C overnight or 2 h at room temperature 
(22-25 °C) and then embedded in paraffin or OCT. Deparaffinized sections were 
boiled for 10 min in 0.01 M citric acid for antigen retrieval. For permeabilization, 
8-11m slices of either cryosections, or sections that had been deparaffinized or 
antigen retrieved were permeabilized for 25 min in 0.25% Triton X-100/PBS and 
blocked for 90 min in 0.25% gelatin/PBS. Primary antibodies were incubated 
overnight at 4°C, and secondary antibodies were incubated for 2h at room 
temperature in 0.25% gelatin/PBS. Nuclei were stained with DAPI (1:5,000, 
Roche), and the slides were mounted in Mowiol. Primary antibodies were used 
at the following dilutions: 1:50 for anti-p19 (M-167, Santa Cruz), anti-p16 
(M-156, Santa Cruz); 1:100 for anti-Sox9 (H-90, Santa Cruz); 1:200 for anti- 
a6-integrin (CD49f clone NKI-GoH3, Serotec), anti-keratin 14 (ab7800, 
Abcam); 1:250 for anti-Ki67 (ab15580, Abcam); 1:500 for anti-keratin 5 
(ab24647, Abcam); and 1:1,000 for anti-GFP (ab290, Abcam; A11122, 
Invitrogen), anti-involucrin (PRB-140C, Covance), anti-filaggrin (ab24584, 
Abcam), anti-loricrin (PRB-145P, Covance), anti-phospho-Smad2 (#3101, Cell 
Signaling), anti-Lefl (clone C12A5, Cell Signaling). Secondary antibodies were 
used at dilutions of 1:500: anti-rabbit, anti-mouse, or anti-rat, conjugated to 
AlexaFluor488 or AlexaFluor594 (Molecular Probes), or anti-rabbit biotin (GE 
Healthcare), followed by an incubation with ABC Kit (Vector Laboratories) and 
FAST 3,3 Diaminobenzidine tablets (Sigma-Aldrich). For apoptosis measure- 
ments, tail skin was stained with the DeadEnd Fluorometric TUNEL System 
(Promega). Staining with haematoxilin or eosin was done according to a standard 
protocol. Pictures were acquired with a Leica DMI 6000B or a Leica TCS SP5 
confocal microscope. 

Time-lapse microscopy and quantification. For time-lapse confocal imaging, 
back skin of Perl-venus mice was fixed with 0.5% agarose in an imaging dish 
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(Ibidi) and over-layered with E-medium. Images were taken every 15 min for a 
period of 48h using a Leica TCS SP5 confocal microscope equipped with a 
tempered chamber of 37°C and 5% CO . Mean fluorescence intensity of indi- 
vidual GFP-positive nuclei was quantified using ImageJ software. 

FACS. Epidermal cells from back skin of Perl-venus, BmallWT and BmallKO 
mice, or tail skin of Bmall WT/K5-SOS and Bmal1KO/K5-SOS mice, were isolated 
as described previously**. Perl—venus mice were killed between ZT10-ZT12 or as 
specified in the figure legends. Cell suspensions were incubated for 30 min on ice 
with the following antibodies at the given dilutions: 1:100 for biotin- or APC- 
conjugated anti-CD34 (clone RAM34, BD Pharmingen), PE-conjugated anti- 
CD31 (clone MEC13.3, eBioscience), PE-conjugated anti-CD45 (clone 30F11, 
eBioscience), PE-conjugated anti-CD140a (clone APA5, BD Pharmingen) and 
APC-Cy7-conjugated anti-epcam (clone G8.8, Biolegend); 1:200 for PE- or 
FITC-conjugated anti-«.6-integrin (CD49f clone NKI-GoH3, Serotec); and 1:500 
for APC-conjugated streptavidin (BD Pharmingen). DAPI staining was used to 
rule out that dead cells were present. FACS was performed using FACSArialI and 
FACSDiva software (BD Biosciences). Sorted cells were collected in E-medium, 
either plated in culture or re-suspended in Trizol (Invitrogen), and subjected 
further to RNA isolation. FACS analysis was performed using LSRII FACS 
Analysers (BD Biosciences) and Flowjo software. 

Arrays. Total RNA was isolated from FACS-sorted cells in Trizol by chloroform 
extraction, followed by the RNA clean-up protocol of RNeasy Micro or Mini Kit 
(Qiagen). Transcriptional profiling was performed using GeneChip Mouse Gene 
1.0 ST Array (Affymetrix). Arrays of venus?"®"' and venus*™ bulge cells of P19 
Perl-venus mice were performed as triplicates from a pool of n= 64 mice. 
Arrays of epidermal cells from 10-months-old BmallWT and BmallKO mice 
were performed as triplicates using three independent mice in each group. 
Functional analysis of microarray data was performed using DAVID 
Bioinformatics Resources 6.7. 

RT-qPCR. Total RNA from cultured or FACS-sorted cells was purified as 
described above, or using either the RNeasy Micro and Mini Kit (Qiagen) or 
miRVana miRNA isolation Kit (Ambion). Equal amounts of RNA were 
reverse-transcribed using Superscript III (Invitrogen). RT-qPCR was performed 
with SYBR Green Master Mix (Roche) and gene-specific primers (as given in 
Supplementary Table 4)** using a Light Cycler 480 Instrument (Roche). Relative 
levels of expression were determined by normalization to pumilio 1 (Pum1), 
using the AAC, method. 

ChIP. For ChIP assays from intact epidermis, mice were killed at ZT2, or for 
time-course analysis every 3 h during a period of 24h, and tails were incubated in 
0.25% trypsin for 2h at 37°C to separate the dermis from the epidermis. Tail 
keratinocytes were extracted as described’’. Cells in suspension or FACS-purified 
cells (2X 10°) were cross-linked for 10min at room temperature in 1% 
formaldehyde. Cross-linking reactions were stopped by adding 1.25 M glycine 
to a final concentration of 125 mM. Cells were centrifuged at 300g for 10 min at 
4°C and washed in cold PBS. Cell lysis, sonification and ChIP assays were per- 
formed using the MAGnify Chromatin Immunoprecipitation System 
(Invitrogen). For each immunoprecipitation, 2 X 10°-10° cells were incubated 
with 2 ul of anti-Bmall or anti-Clock antibody (provided by J. Ripperger, 
University of Fribourg)”, or 2 ul of rabbit IgG control antibody (Invitrogen). 
RT-qPCR was performed as described above using gene-specific primers 
(Supplementary Table 5). 

Genotyping and western blots. Genomic DNA of primary mouse keratinocytes 
from BmallWT and BmallKO mice was isolated using a standard protocol. 
Multiplex PCR was performed as previously described to determine recombina- 
tion efficiency”. Protein extracts from newborn keratinocytes or from tail 
epidermis were analysed by SDS-PAGE and western blotting for Bmal1 (ref. 49), 
anti-phospho-Smad2 (1:1,000, #3101, Cell Signaling), anti-Lefl (1:1,000, clone 
C12A5, Cell Signaling), anti-Sox9 (1:400, H-90, Santa Cruz) and anti-tubulin 
(1:5,000, Sigma Aldrich) 

Luciferase assay. Primary mouse keratinocytes were transiently transfected with 
either FOP-Flash or TOP-Flash and pCMV-Renilla plasmids using FuGENE6 
(Roche) according to the manufacturer’s instructions. The TOP-Flash construct 
reports Wnt activity by driving the expression of Luciferase under the regulation 
of TCF-Lef binding sites. FOP-Flash contains the same binding sites mutated so 
that they are no longer responsive to TCF/Lef. Cells were treated in the absence or 
presence of 1 tM BIO (Calbiochem) at 48 h post-transfection for a period of 24h. 
Luciferase activity was measured in a Centro LB 960 luminometer (Berthold 
Technologies) using Dual-Luciferase Reporter Assay System (Promega). 
Promoter analysis. Gene promoter analysis for potential Bmal1/Clock-binding 
sites (in general, from —5,000 to + 1,000 bases from the transcriptional start site) 
were analysed using Genomatix Software. 

Statistics. Results are presented as mean + s.e.m. Statistical significance was 
determined by two-tailed Student’s t-test, one-way ANOVA, two-way ANOVA 
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with Bonferroni post-test, or Cosinor analysis. A P value of P= 0.05 was con- 
sidered to be statistically significant. 
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Two ten-billion-solar-mass black holes at the 
centres of giant elliptical galaxies 


Nicholas J. McConnell’, Chung-Pei Ma’, Karl Gebhardt”, Shelley A. Wright', Jeremy D. Murphy’, Tod R. Lauer’, James R. Graham!"* 


& Douglas O. Richstone” 


Observational work conducted over the past few decades indicates 
that all massive galaxies have supermassive black holes at their 
centres. Although the luminosities and brightness fluctuations of 
quasars in the early Universe suggest that some were powered by 
black holes with masses greater than 10 billion solar masses’”, the 
remnants of these objects have not been found in the nearby 
Universe. The giant elliptical galaxy Messier 87 hosts the hitherto 
most massive known black hole, which has a mass of 6.3 billion 
solar masses**. Here we report that NGC 3842, the brightest galaxy 
in a cluster at a distance from Earth of 98 megaparsecs, has a central 
black hole with a mass of 9.7 billion solar masses, and that a black 
hole of comparable or greater mass is present in NGC 4889, 
the brightest galaxy in the Coma cluster (at a distance of 103 
megaparsecs). These two black holes are significantly more massive 
than predicted by linearly extrapolating the widely used correla- 
tions between black-hole mass and the stellar velocity dispersion or 
bulge luminosity of the host galaxy’’. Although these correlations 
remain useful for predicting black-hole masses in less massive 
elliptical galaxies, our measurements suggest that different evolu- 
tionary processes influence the growth of the largest galaxies and 
their black holes. 

Empirical scaling relations between black-hole mass (Mgyy), galaxy 
bulge velocity dispersion (a) and luminosity (L) are commonly used to 
estimate black-hole masses, because for most galaxies we are unable to 
make a direct measurement. Estimates of the number density of black 
holes in a given mass range thus depend upon the empirically deter- 
mined Mpy-o and Mpy-L relations over an appropriate range of 
galaxy masses. Directly measuring Mgy from the kinematics of stars 
or gas in the vicinity of the black hole is particularly difficult at the 
highest galaxy masses, because massive galaxies are rare, their typical 
distances from Earth are large and their central stellar densities are 
relatively low. The most massive galaxies are typically brightest cluster 
galaxies (BCGs), that is, giant elliptical galaxies that reside near the 
centres of galaxy clusters. 

We have obtained high-resolution, two-dimensional data of the 
line-of-sight stellar velocities in the central regions of NGC 3842 and 
NGC 4889 using integral-field spectrographs at the Gemini North and 
Keck 2 telescopes, in Hawaii. The stellar luminosity distribution of 
each galaxy is provided by surface photometry from NASA’s Hubble 
Space Telescope and ground-based telescopes’®"'. NGC 3842 is the 
BCG of Abell 1367, a moderately rich galaxy cluster. NGC 4889 is the 
BCG of the Coma cluster (Abell 1656), one of the richest nearby galaxy 
clusters. We targeted these two galaxies because they have relatively 
high central surface brightnesses and lie at an accessible distance for 
direct measurements of Mgy. 

We measured the distribution of stellar velocities at 82 different 
locations in NGC 3842. The line-of-sight velocity dispersion in NGC 
3842 is between 270 and 300kms at large galactocentric radii (r) 
and rises in the central 0.7 arcsec (r < 330 pc), peaking at 326kms * 


(Figs 1 and 2). We determined the mass of the central black hole by 
constructing a series of orbit superposition models’*. Each model 
assumes a black-hole mass, stellar mass-to-light ratio and dark-matter 
profile, and generates a library of time-averaged stellar orbits in the 
resulting gravitational potential. The model then fits a weighted com- 
bination of orbital line-of-sight velocities to the set of measured stellar 
velocity distributions. The goodness-of-fit statistic 7” is computed as a 
function of the assumed values of Mpy, and the stellar mass-to-light 
ratio. Using our best-fitting model dark-matter halo, we measure a 
black-hole mass of 9.7 X 10° solar masses (Mo), with a 68% con- 
fidence interval of (7.2-12.7) x 10°Mo. Models with no black hole 
are ruled out at the 99.996% confidence level (Ay? = 17.1). We find 
the stellar mass-to-light ratio to equal 5.1Mo/Le in the R band 
(Lo, solar luminosity), with a 68% confidence interval of 4.4- 
5.8Mo/Lo. 

We measured stellar velocity distributions at 63 locations in NGC 
4889 and combined our measurements with published long-slit 
kinematics at larger radii’*. The largest velocity dispersions in NGC 
4889 are located across an extended region on the east side of the 
galaxy. The stellar orbits in our models are defined to be symmetric 
about the galaxy centre, so we constrain Mpy, by running separate trials 
with velocity profiles from four quadrants of the galaxy. The best- 
fitting black-hole masses from the four quadrants range from 
9.8 X 10’Mo to 2.7 X 10'°Mo. All quadrants favour tangential orbits 
near the galaxy centre, which cause the line-of-sight velocity dispersion 
to decrease even as the internal three-dimensional velocity dispersion 
increases towards the black hole. Although no single model is consist- 
ent with all of the observed kinematic features in NGC 4889, we can 
define a confidence interval for Mgy by considering the most extreme 
confidence limits from the cumulative set of models. The correspond- 
ing 68% confidence interval is (0.6-3.7) X 10'°Mo. We adopt a black- 
hole mass of 2.1 X 10'°Mo, corresponding to the midpoint of this 
interval. 

Figure 3 shows the Mgy-o and Mgy-L relations, using data com- 
piled from studies published before the end of August 2011, plus our 
measurements of NGC 3842 and NGC 4889. Tabulated data with 
references are provided as Supplementary Information. The most 
widely used form for both relations is a power law with a constant 
exponent. Straight lines in Fig. 3 show our fits to Mgy(o) and Mpy(L). 
The relationship between o and L, however, flattens at high galaxy 
masses, and constant-exponent power laws for the Mgy-o and Mgy-L 
relations produce contradictory predictions for Mgy in this mass 
range’*. Direct measurements of Mpy in higher mass galaxies will 
compel the revision of one or both of the Mgy—-o and Mgy-L relations. 

The average velocity dispersion in NGC 3842 is 270kms ', 
measured outside the black hole radius of influence (1.2 arcsec or 
570 pc) and inside the two-dimensional half-light radius (38 arcsec 
or 18 kpc). Although NGC 3842 hosts a black hole more massive than 
any previously detected, its average dispersion ranks only fourteenth 
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Figure 1 | Two-dimensional maps of the line-of-sight stellar velocity 
dispersions. The maps show the central regions of NGC 3842 (a) and NGC 
4889 (b) observed using the GMOS spectrograph” on the 8-m Gemini North 
telescope. Additional kinematics at large radii were measured using the VIRUS- 
P spectrograph” on the 2.7-m Harlan J. Smith telescope, and additional high- 
resolution data were acquired with OSIRIS spectrograph” at the 10-m Keck 2 
telescope. GMOS, OSIRIS and VIRUS-P are all integral-field spectrographs, 


among 65 galaxies with direct measurements of Mgy. Its luminosity 
ranks fifth in this sample of galaxies and is exceeded only by other 
BCGs. On the basis of the values of o and L for NGC 3842, our revised 
Mgy-o and Mpy-L relations predict that Mgy = 9.1 X 10°Mo and 
2.5 X 10’Mo, respectively. Similarly, for NGC 4889 the respective 
predictions are Mpyy = 3.3 X 10’Mo and 4.5 X 10’Mo. These predic- 
tions are smaller than our direct measurements of Mpy by 1.6-4.6 
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which record spectra at multiple positions in a two-dimensional spatial array. 
The horizontal dashed line in each panel traces the major axis of the galaxy. The 
median 1-s.d. errors in velocity dispersion are 12kms | and 20kms'! for 
NGC 3842 and NGC 48839, respectively. In NGC 4889, the highest velocity 
dispersions, near 410 km s |, are located on the east side of the galaxy, at least 
1.1 arcsec from the centre. 


times the 1-s.d. scatter in the Mgy-o and Mpgy-L relations’. Four 
measurements of Mgy in BCGs existed before this work. Two mea- 
surements based on gas dynamics and one based on stellar dynamics 
all lie within 1.2s.d. of our revised fits to the Mgy-o and Mgy-L 
relations’*'®. Yet the measurement of Mgy in NGC 1316, the BCG 
of the Fornax cluster, is 3.4 s.d. less than that predicted by our Mpy-L 
relation’’. The high scatter indicated by this collection of measure- 
ments reveals large uncertainties in the standard practice of using 
galactic o or L as a proxy for the central black-hole mass in giant 
elliptical galaxies and their predecessors. 

Several BCGs within 200 Mpc of Earth are at least twice as luminous 
as NGC 3842 and three times as luminous as Messier 87, which hosted 
the most massive black hole known before this work. In spite of their 
extreme luminosities, BCGs have velocity dispersions similar to those 
of the most massive field elliptical galaxies. Yet the most massive black 
holes are found predominantly in BCGs (Fig. 3). How galaxies are 
assembled and the role of gas dissipation affect the correlations (or 
lack thereof) among My, o and L. Simulations of mergers of gas-rich 
disk galaxies are able to produce remnant galaxies that follow the 
observed Mgy-¢ correlation in Fig. 3a over the intermediate mass 
range Mgy ~ 10’Mp-10°Mo (refs 18, 19). By contrast, simulated 
mergers of elliptical galaxies with low-angular-momentum progenitor 
orbits increase Mpy and L by similar numerical factors, without 
increasing the velocity dispersion”. Because these mergers are a likely 
path to forming the most massive galaxies, the Mgy-¢ correlation may 
steepen or disappear altogether at the highest galaxy masses. Massive 
elliptical galaxies retain residual quantities of gas even after the decline 
of star formation. Accretion of this gas onto the galaxies’ central black 


Figure 2 | One-dimensional profiles of the line-of-sight velocity 
dispersions. a, Dispersion versus radius in NGC 3842, after averaging data at a 
given radius, based on measurements with GMOS (black circles) and VIRUS-P 
(red diamonds). The solid blue line is the projected line-of-sight dispersion 
from our best-fitting stellar orbit model of NGC 3842. b, Dispersion versus 
radius along the major axis of NGC 4889, measured using GMOS (black circles) 
and the William Herschel Telescope’* (WHT; green triangles). The maximum 
velocity dispersion occurs at r = 1.4 arcsec. The solid blue line is the projected 
line-of-sight dispersion from our best-fitting orbit model using data from the 
east side of NGC 4889 (r > 0). The dashed orange line is from our best-fitting 
orbit model using data from the west side of NGC 4889 (r < 0). Error bars, 1 s.d. 
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Figure 3 | Correlations of dynamically measured black hole masses and bulk 
properties of host galaxies. a, Black-hole mass (Mgy;) versus stellar velocity 
dispersion (c) for 65 galaxies with direct dynamical measurements of Mgyy. For 
galaxies with spatially resolved stellar kinematics, ¢ is the luminosity-weighted 
average within one effective radius (Supplementary Information). b, Black-hole 
mass versus V-band bulge luminosity, Ly (Lev, solar value), for 36 early-type 
galaxies with direct dynamical measurements of My. Our sample of 65 
galaxies consists of 32 measurements from a 2009 compilation’, 16 galaxies 
with masses updated since 2009, 15 new galaxies with Mpy measurements and 
the two galaxies reported here. A complete list of the galaxies is given in 
Supplementary Table 4. BCGs (defined here as the most luminous galaxy in a 
cluster) are plotted in green, other elliptical and SO galaxies are plotted in red, 
and late-type spiral galaxies are plotted in blue. The black-hole masses are 


holes could help increase Mgy and further steepen the Mgy—-o and 
Mpy-L relations. 

Black holes in excess of 10'°Mo are observed as quasars in the early 
Universe, from 1.4 X 10° to 3.3 X 10’ yr after the Big Bang” (redshift, 
z= 2-4.5). Throughout the last 1.0 x 10'° yr, however, these extremely 
massive black holes have not been accreting appreciably, and the average 
mass of the black holes powering quasars has decreased steadily. Quasar 
activity and elliptical galaxy formation are predicted to arise from 
similar merger-triggered processes, and there is growing evidence that 
present-day massive elliptical galaxies once hosted the most-luminous 
high-redshift quasars*’. However, definitive classification of these 
quasars’ host galaxies has remained elusive. 

Our measurements of black holes with masses of around 10'°Mg in 
NGC 3842 and NGC 4889 provide circumstantial evidence that BCGs 
host the remnants of extremely luminous quasars. The number density 
of nearby BCGs (~5 X 10 ° Mpc *) is consistent with the number 
density of black holes (~3 10-7 to 10-°>Mpc °) with masses 
between 10°Mo and 10'°Mg@ predicted from the Mgy-L relation 
and the luminosity function of nearby galaxies. Furthermore, both 
quantities agree with predictions based on the black-hole masses and 
duty cycles of quasars. The black-hole number density predicted from 
the Mgy-¢ relation, however, is an order of magnitude less than the 
inferred quasar population'*”’. These two predictions can be reconciled 
if the Mgy-o relation has upward curvature or a large degree of intrinsic 
scatter in Mpy at the high-mass end, as suggested by our new measure- 
ments. With improvements in adaptive optics instrumentation on 
large optical telescopes and very-long-baseline interferometry at radio 
wavelengths, black holes are being sought and detected in increasingly 
exotic host galaxies. Along with our measurements of the black-hole 


measured using the dynamics of masers (triangles), stars (stars) or gas (circles). 
Error bars, 68% confidence intervals. For most of the maser galaxies, the error 
bars in Mgy are smaller than the plotted symbol. The solid black line in a shows 
the best-fitting power law for the entire sample: logio(Mpu/Mo) = 

8.29 + 5.12log,o[o/(200 km s~')]. When early-type and late-type galaxies are 
fitted separately, the resulting power laws are logio(Mpu/Mo) = 

8.38 + 4.53logi9[o/(200 km s — )] for elliptical and SO galaxies (dashed red line) 
and logio(Mgu/Mo) = 7.97 + 4.58logio[a/(200kms ')] for spiral galaxies 
(dotted blue line). The solid black line in b shows the best-fitting power law: 
logio(Mpu/Mo) = 9.16 + 1.16logio(Ly /10''Le). We do not label Messier 87 
as a BCG, as is commonly done, because NGC 4472 in the Virgo cluster is 
0.2 mag brighter. 


masses in NGC 3842 and NGC 4889, future measurements in other 
massive galaxies will quantify the cumulative growth of supermassive 
black holes in the Universe’s densest environments. 
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Atomic homodyne detection of continuous- variable 
entangled twin-atom states 


G; Gross', H. Strobel!, E. Nicklas!, T. Zibold', N. Bar-Gill?+, G. Kurizki* & M. K. Oberthaler! 


Historically, the completeness of quantum theory has been questioned 
using the concept of bipartite continuous-variable entanglement’. 
The non-classical correlations (entanglement) between the two sub- 
systems imply that the observables of one subsystem are determined 
by the measurement choice on the other, regardless of the distance 
between the subsystems. Nowadays, continuous-variable entangle- 
ment is regarded as an essential resource, allowing for quantum 
enhanced measurement resolution’, the realization of quantum 
teleportation® > and quantum memories**®, or the demonstration 
of the Einstein-Podolsky-Rosen paradox’’°. These applications 
rely on techniques to manipulate and detect coherences of quantum 
fields, the quadratures. Whereas in optics coherent homodyne 
detection’® of quadratures is a standard technique, for massive 
particles a corresponding method was missing. Here we report the 
realization of an atomic analogue to homodyne detection for the 
measurement of matter-wave quadratures. The application of this 
technique to a quantum state produced by spin-changing collisions 
in a Bose-Einstein condensate’’’? reveals continuous-variable 
entanglement, as well as the twin-atom character of the state’. 
Our results provide a rare example of continuous-variable 
entanglement of massive particles®’*. The direct detection of atomic 
quadratures has applications not only in experimental quantum 
atom optics, but also for the measurement of fields in many-body 
systems of massive particles’’. 

Continuous-variable entangled states, whose inter-mode entangle- 
ment is reflected in quadrature as well as population correlations, 
are routinely generated by parametric down-conversion in 
optical parametric amplifiers (OPAs)'°. Spin-changing collisions in 
ultracold bosonic quantum gases provide a similar nonlinear process 
for matter waves, generating atom pairs with one atom spin-up and the 
other spin-down—that is, twin atoms'®’. Correlated photon or atom 
pairs in the signal |{) and idler | |) modes are created from a pump 
field, via nonlinear interactions described by the Hamiltonian 
ho aja +a ral where a, (a,) is the creation (annihilation) 
operator of the respective mode k and 21h is Planck’s constant. This 
Hamiltonian presumes a large-amplitude coherent pump field, whose 
depletion or distortion by the signal and idler modes is negligible, such 
that it can be treated classically. In optics, the effective nonlinearity 
%q is proportional to the amplitude of this pump field and the suscep- 
tibility of the medium. In the analogous regime for atoms, the effective 
nonlinearity originates from interatomic interactions and is propor- 
tional to the population in the pump mode, that is, the Bose-Einstein 
condensate (BEC). For initially empty signal and idler modes, 
quantum fluctuations are amplified and the output is the so-called 
two-mode squeezed vacuum’’. This state is characterized by the 
vanishing of the mean field amplitude (E,) = 0 in each of the modes, 
while the mean intensity is non-zero (Ik) 0c ELE, >0. The asso- 


ciated two-mode entanglement is revealed by the quadratures 
Xi(9)=a je” tate” +h.c,, the signature being the squeezing of 
their variance for appropriately chosen phase g (h.c. indicates the 
Hermitian conjugate terms). To access the two-mode quadratures, 


the coherent homodyne measurement technique is employed, 
whereby the conjugate canonical quadratures X; =a} +a, and 
Y,= il a, =a) are measured by mixing the quantum mode with a 
strong ‘classical reference field, the local oscillator al o=/Nioe? 
with large amplitude No. Experimental control of the local 
oscillator phase @ provides access to the continuous distribution of 
single mode quadratures X;(~) =a,e” +a,e~” and their two-mode 
counterparts X..(~) (X, and Y; are used as abbreviations for X;(0) and 
X,({1/2)). For the squeezed vacuum state, the quadrature distribu- 
tion of individual modes is isotropic and centred around the origin, 
reflecting the undefined phase. The correlation of the modes leads to 
anisotropic and squeezed distributions of the two-mode quadratures 
X+(@) (refs 10, 20), as seen in Fig. la. 

Here we report the development of atomic homodyne detection and 
apply this technique to measure quadratures of twin-atom matter- 
wave fields generated by controlled spin-changing collisions in a 
BEC. The experiments involve a spinor BEC of *’Rb trapped in a 
one-dimensional optical lattice with few hundred atoms per site and 
a lattice spacing of 5.5 um (Fig. 1b). A high lattice potential ensures 
that tunnelling between sites is negligible on the experimental time- 
scale, such that the condensates in the different sites are independent”. 
The density in the centre of the lattice sites is of the order of 
2x10'*cm~*, resulting in a minimal spin healing length of 
€ ~ 1 um, which is comparable to the extension of the on-site wave- 
function (approximately 1.1,1m full-width at half-maximum, 
FWHM). The spatial degrees of freedom are therefore frozen and 
the dynamics happens exclusively in the hyperfine spin’. The con- 
densate is initially prepared in the (F, mp) = (2, 0) hyperfine state, 
which serves as the analogue to the pump mode in an OPA. Making 
use of a combination of quadratic Zeeman shift and state-dependent 
microwave dressing”*”’, the spin dynamics is energetically restricted to 
a spin-1 subspace defined by the 0, +1 Zeeman states (see Sup- 
plementary Information). We choose a constant evolution time under 
spin-changing collisions of 22 ms in all our experiments. In the regime 
of small pump mode depletion, spin-changing collisions can be 
described by an OPA-type pair coupling between the Zeeman 
states'?'*"°"” with the +1 and —1 states as signal ({) and idler (|) 
modes, respectively. In optics, the integrated nonlinearity is restricted 
by short interaction times, a limit which is surpassed in the atomic 
system, such that continuous-variable entangled states with large 
populations can be realized. As the underlying process generates pairs 
of atoms in the signal and idler modes—twin atoms—the mean and 
variance of the population difference N_ = N, — N; should ideally 
vanish although the total population N,=N,+N;, fluctuates 
strongly. This can be experimentally confirmed by absorption imaging 
of the atomic cloud*' where state selectivity is achieved by Stern- 
Gerlach separation (Fig. 1b, c). 

A quantitative analysis of the variance of the population difference 
A’N_ (where A’ indicates variance) and the distribution of the popu- 
lation sum N. gives a first indication of the quantum state of the 
system (Fig. 2). Owing to the pump mode population dependent 
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Figure 1 | Analogy to optics and measured population correlations of twin- 
atom states. a, Parametric down-conversion with light, and the analogy to 
atomic spin-changing collisions. In a nonlinear medium, effective interactions 
(top) result in the creation of photon pairs in the signal (red) and idler (blue) 
modes from pump mode photons (green). The quadratures of the output 
modes (bottom) are centred around the origin—the individual ones are 
isotropic, while the two-mode quadratures are squeezed reflecting their 
correlations (purple). See text for nomenclature. Pair creation due to spin- 
changing collisions in tight traps is an analogous process in quantum atom 
optics. b, Experimental system and exemplary raw absorption pictures. Left, 
schematic representation (top) of the one-dimensional optical lattice on which 


nonlinearity, the mean atom number in signal and idler mode (N,) 
increases nonlinearly with the total atom number N, resulting in a 
growing fraction (N)/N. In the small N <300 limit, the observed 
distribution of the total population N, matches the prediction for 
the squeezed vacuum (taking detection noise into account by convolu- 
tion with a Gaussian) with maximal squeezing parameter r~ 2 
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is superimposed an optical dipole trap to tightly confine the atoms in all 
directions. In each site, selected Zeeman levels of the hyperfine spin (bottom) 
are the only accessible degrees of freedom. The states involved in the spin- 
changing collision process are highlighted (see text for nomenclature). Right, 
three typical experimental pictures after spin-changing collisions where the 
clear spatial separation of sites and states can be seen. c, Population 
correlations. These correlations are visible directly on the raw data (right) 
where the atom numbers in the signal (red) and idler (blue, numbers shown 
with opposite sign) modes fluctuate strongly while their difference (black) 
shows small noise. Left, a zoom into 25 experimental realizations, clearly 
showing that the fluctuations are common mode. 


(ref. 10). The corresponding mean population is (N.) ~ 25 and the 
data shows a comparably large standard deviation—in contrast, the 
noise in the population difference N_ almost vanishes (AN_ < 1.9 
atoms). It is important to note that the atomic variances reported 
throughout this Letter are corrected for detection noise, as detailed 
in the Supplementary Information. For larger total atom numbers N, 


Figure 2 | Population of the signal and idler modes. Main figure: red data, 
mean population (N+) = (N;+N)) versus total atom number N. Error bars, 
fluctuations (1 s.d.) of the data. The large uncertainty is a feature of the 
nonlinear process itself, which leads to a non-Gaussian distribution of the mode 
population. Grey area, the sub-Poisson regime, the upper bound of which is 
obtained by a spline fit to (N..). Blue data points, variance of the population 
difference A7N_, which is compatible with zero for small N (error bars, 1 s.d. 
statistical uncertainty). Its increase with N is reproduced when taking particle 
loss due to spin relaxation into account (black line). Insets, the distribution of 
the mode population for indicated total atom numbers. For small N, the 
distributions match the prediction for a non-depleted pump with the measured 
mean population (N.,) (black lines) whereas for larger N they clearly differ. The 
fitted squeezed vacuum distribution for 250 < N < 300 corresponds to a 
squeezing parameter of r ~ 2 (ref. 10). 
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the pump depletion can no longer be neglected”; this results in a 
breakdown of the analogy to the OPA picture, and the observed dis- 
tributions of N,. differ from the squeezed vacuum (Fig. 2 insets). In this 
regime, the well controllable spin dynamics offer prospects for the 
deterministic generation of non-gaussian entangled twin-atom 
states”. Density dependent two-body spin relaxation loss causes the 
perfect pair correlations to deteriorate, such that the relative popu- 
lation noise grows with N. Averaging the results for all total atom 
numbers, we find the noise A7N_ suppressed by 6.9 dB below the 
Poisson level (N,,). 

The observable populations do not suffice to characterize the two- 
mode state; we must access coherences between the modes, the two- 
mode quadratures. We use atomic homodyning by analogy to the 
experimental technique used in optics, whereby the quadratures are 
measured relative to a reference field. Yet in the atomic case, the popu- 
lation of available local oscillator (LO) reference fields is often limited 
to rather small atom numbers. This has important consequences for 


(aloo); 


which in general differ from the canonical ones. A measure for this 
difference is given by the ratio of the quantum mode population to the 
local oscillator population, by which the Heisenberg uncertainty rela- 
tion is modified?’, AX, AY; >1— (Ni) / (at oaro : 

For the twin-atom state, we expect peculiar coherence properties: 
the quadratures of the individual modes fluctuate strongly around a 
vanishing mean, while the two-mode quadratures X.(@) show reduced 
fluctuations for certain ¢ reflecting the quantum correlations. To reveal 
these characteristics in the coherences of the generated quantum fields, 
we employ a homodyning scheme such that information about the 
two-mode quadratures X + (¢~) can be obtained. Measurement of these 
observables requires comparison of each mode with a local oscillator. 
We ensure relative local oscillator phase stability by using the pump 
mode (2, 0) as the simultaneous reference, whose phase g can be 
controlled by microwave dressing. An atomic three-port beam splitter 


the observable quadratures X= (a,a.0 + axa) 


LETTER 


is realized by radio-frequency coupling of signal and idler modes to the 
pump. Analysis of the fluctuations in atom number sum and difference 
No,-1) £ Neg,1) between the (2, —1) and (2, 1) Zeeman states after the 
mixing provides, after proper normalization, an upper bound (ub) for 
the variance of the two-mode quadratures A’X"?(g). The measured 
fluctuations include further noise contributions governed by popu- 
lation fluctuations in the signal and idler modes, which are small only 
for the quadrature difference. Further details of the analysis in the three 
mode framework and the choice of the normalizations can be found in 
Supplementary Information. Figure 3a and b shows raw data in the 
regime of small pump depletion of the measured normalized atom 
number difference and sum versus local oscillator phase y. No phase 
dependence is visible in the mean, but nevertheless, the fluctuations are 
strongly modulated, indicating phase correlations between the two 
atomic modes. 

Weset up a different experiment to obtain the quadrature fluctuations 
of the signal and idler modes individually. As local oscillators, we prepare 
an atomic population in the (1, + 1) states before initiating the dynamics. 
Mixing of the local oscillators with the quantum modes is done by a two- 
photon microwave and radio-frequency 1/2 coupling pulse. For both the 
signal and idler mode, we find coupling phase independent fluctuations 
of the atomic quadratures X41) — (No, +1) —Na,z1)/V(Nto) and a 
vanishing mean. The mean local oscillator population (Nyo) of the 
(1, +1) state was measured by omitting the mixing pulse. 

To ensure that our observations are indeed resulting directly from 
the spin-changing collision process, the technical phase noise has to be 
characterized. The one-dimensional optical lattice set-up allows for 
this characterization by a reference measurement in which the popu- 
lation in the (2, +1) states is also prepared by electromagnetic coupling 
and spin-changing collisions are tuned off-resonance during the 
evolution time. The inter-site correlations observed after the mixing 
are shown in Fig. 3c for this reference experiment (top) and for the 
quantum state prepared by spin changing collisions (bottom). In both 
cases, we observe similarly large on-site fluctuations in the atom 
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Figure 3 | Atomic homodyning. a, Homodyne measurement of the 
quadrature difference. After symmetric coupling to the mp = 0 state (inset in 
b), the normalized population difference in the mp = +1 states provides a 
measure for the two-mode quadrature difference. Phase dependent noise is 
visible directly on the raw data while no dependence is observed in the mean 
population difference (solid blue line). b, Homodyne measurement of the 
quadrature sum. An upper bound for the variance of the two-mode quadrature 
sum is obtained from the fluctuations of the normalized population sum after 
the coupling. The fluctuations of the raw data are strongly phase dependent 
while the mean (blue solid line) stays constant. The variance minimum is 
shifted by 1/2 as compared to a, indicating noise suppression in the orthogonal 
two-mode quadrature. In a and b, the data correspond to all measurements 
with total numbers 150 < N < 200, and the red lines indicate the inferred 


Local oscillator phase, g 


standard deviation above and below the mean after subtraction of the readout 
noise. ¢, Classical versus quantum preparation—inter-site correlation 
measurement. Characterization of classical phase noise is crucial to obtain 
information about the individual quadratures of the signal and idler modes. For 
initial states prepared classically, that is by linear coupling, strong inter-site 
correlations between different sites k,l in the optical lattice after mixing with the 
local oscillator (top illustrations) reveal classically washed out coherences 
(middle). In contrast, when the (F = 2, mp = + 1) states are populated by spin- 
changing collisions, no correlations are observed within the statistical 
uncertainty showing that (Xi |)) =0 asa direct result of the spin dynamics 
(bottom). The colour scale is used to represent the observed inter-site 
correlations. 
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Figure 4 | Two-mode quadrature fluctuations and mode inseparability. 

a, Two-mode sum and difference quadratures. The measured variances 

A’ xX‘ (@) (blue) and A*X“>(g) (red)—upper bounds for the sum and 
difference two-mode quadrature variances—are plotted versus local oscillator 
phase for total atom numbers 150 < N < 200. The shown experimental results 
are not corrected for readout noise (in contrast to the values stated throughout 
the text and in b, c) but its magnitude is shown as the black line. b, Continuous- 
variable quadrature entanglement. Inseparability is detected in the grey shaded 


number difference. However, in the reference experiment we can use 
the inter-site correlations to reduce these fluctuations by a factor of 20, 
such that the remaining fluctuations are close to the expected noise 
limit for two coherent modes. This shows that inter-site correlations 
detect finite coherences that have been washed out by classical phase 
noise in our measurements. In the case of spin-changing collisions, 
no correlations are observed within the statistical uncertainty, mean- 
ing that the vanishing coherences are indeed caused by the process 
itself. Given this result, we can be sure that technical phase noise does 
not influence the sum of the orthogonal quadrature variances 
AX, 1) +A? Y4 1) =2A°X1 |) (the measured quadrature variance is 
independent of the local oscillator phase), such that it is a useful 
observable to characterize the quadrature fluctuations. For further 
details on the inter-site correlation measurement, see Supplementary 
Information. 

Inter-mode entanglement of the twin-atom state is reflected in 
suppressed noise in the two-mode quadratures compared to the 
quadrature fluctuations of the individual modes'? A?X_ +A°Y, 
<A’X;+A°X,+A°Y;+A’Y,. When investigating this inequality 
in our experiment, care has to be taken to ensure comparability of 
the single- and two-mode measurements”’. In both measurements, the 
ratio of the sum of the population in the signal and idler modes to the 
reference mode population is below 10% for small N < 200, and grows 
to approximately 55% for the largest N. In Fig. 4a we plot the measured 
upper bound for the two-mode quadrature variances versus local 
oscillator phase A*X%>(g) for an exemplary total atom number. The 
sum of the minima at @ and @ — (n/2) provides an upper bound for 
the sum of the orthogonal two-mode quadrature variances 
> A; =A°X*> + A’ ">, while the measurement of the single-mode 
quadrature variances )> Aj =2(A°X; + A?X |) has been discussed 
above. The generated quantum states fulfil the inequality 
yA} < YA} for a wide range of total atom numbers (Fig. 4b), show- 
ing that the produced twin-atom state is inseparable—the observed 
minimal ratio is )> A} /}> Aj~0.2. 

For the squeezed vacuum, which is theoretically expected in the 
non-depleted pump regime, the two-mode quadrature variances 
should be reduced below the noise level of two coherent modes 
A’X“>(g) =2. For the more precisely detectable variance of the 
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region where )~ A}, the upper bound for the sum of the two-mode quadrature 
variance minima (green), is smaller than 5> Ai, the sum of the variance in the 
individual quadratures (black). Inset, ratio > A} /S> Aj. ¢, Minimum of the 
two-mode difference quadrature fluctuations. For small total atom numbers 
N—in the non-depleted pump regime and where spin relaxation loss is small— 
the minimum two-mode quadrature variance A?X*(g,) inferred from a local 
quadratic fit is at the noise limit for two coherent fields (dashed black line). 
Inset, zoom into the small N region. All error bars, 1-s.d. uncertainties. 


quadrature difference, we find the noise minimum for small total N 
to be comparable to this level, limited by the experimental signal to 
noise ratio. In Fig. 4c we show the dependence of this noise minimum 
on the total atom number N—detection noise subtraction is crucial 
here, and without it we find A?X*%°(~)~17 as the minimal value. 

In conclusion, we have developed an atomic homodyne detection 
method, which allows for the measurement of quadrature correlations 
of twin-atom quantum fields. The continuous-variable entangled 
states have been produced in a deterministic manner using controlled 
atomic spin interactions—for small total atom numbers, the observed 
population distributions are compatible with the atomic two-mode 
squeezed vacuum. Einstein-Podolsky-Rosen (EPR) entanglement is 
revealed by A(X; - X))A*(Y; + Y|) <1, and we find 4 + 17 for this 
value, showing that EPR entanglement for atomic quadratures is 
within reach. In a future extension of our scheme, the local oscillator 
might be split before the mixing, enabling precise detection of the two- 
mode quadrature sum as well’®. Spatial separation of the modes can be 
implemented by employing the Stern-Gerlach technique, as an 
alternative approach to recently reported experiments’’”*. The regime 
of large average pair population is accessible, such that highly non- 
classical quantum states might be generated by controlled quantum 
spin evolution”’. 

We note that in parallel to this work, sub-Poissonian number fluc- 
tuations after spin-changing collisions have been observed by two 
other groups’, and it has been shown that the generated quantum 
correlated states are useful for noise interferometry beyond the standard 
quantum limit*®. 
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Trifluoromethylation of arenes and heteroarenes by 
means of photoredox catalysis 


David A. Nagib' & David W. C. MacMillan! 


Modern drug discovery relies on the continual development of syn- 
thetic methodology to address the many challenges associated with 
the design of new pharmaceutical agents’. One such challenge arises 
from the enzymatic metabolism of drugs in vivo by cytochrome 
P450 oxidases, which use single-electron oxidative mechanisms to 
rapidly modify small molecules to facilitate their excretion’. A com- 
monly used synthetic strategy to protect against in vivo metabolism 
involves the incorporation of electron-withdrawing functionality, 
such as the trifluoromethyl (CF3) group, into drug candidates’. The 
CF; group enjoys a privileged role in the realm of medicinal chem- 
istry because its incorporation into small molecules often enhances 
efficacy by promoting electrostatic interactions with targets, 
improving cellular membrane permeability, and increasing robust- 
ness towards oxidative metabolism of the drug*®. Although com- 
mon pharmacophores often bear CF; motifs in an aromatic system, 
access to such analogues typically requires the incorporation of the 
CF; group, or a surrogate moiety, at the start of a multi-step syn- 
thetic sequence. Here we report a mild, operationally simple strat- 
egy for the direct trifluoromethylation of unactivated arenes and 
heteroarenes through a radical-mediated mechanism using com- 
mercial photocatalysts and a household light bulb. We demonstrate 
the broad utility of this transformation through addition of CF; toa 
number of heteroaromatic and aromatic systems. The benefit to 
medicinal chemistry and applicability to late-stage drug develop- 
ment is also shown through examples of the direct trifluoromethy- 
lation of widely prescribed pharmaceutical agents. 

The CF3 moiety is typically installed by means of cross-coupling 
technologies catalysed by transition metals. These methods historically 
required the stoichiometric use of metal salts or organometallic com- 
plexes, and have shown limited generality’. Recently, however, copper 
and palladium complexes have been successfully shown to enable this 
transformation in a catalytic fashion*®. Using either a nucleophilic or 
electrophilic source of CF; (Ruppert-Prakash or Umemoto’s reagent, 
respectively), it is now possible to install an aryl trifluoromethyl moiety 
in place of halides and boronic acids, or adjacent to tailored directing 
groups (Fig. 1b)”. These cross-coupling reactions have facilitated the 
synthesis of a range of CF; analogues without the reliance on pre- 
fluorinated building blocks. However, the ideal position of CF; sub- 
stitution on a drug candidate must still be determined through parallel, 
multi-step syntheses employing aryl precursors bearing activating 
groups at various positions around an aromatic ring. Therefore, a 
remaining challenge in the synthesis of organofluorine drugs is this 
reliance on substituted precursors and their varying amenability to 
cross-coupling reactions. 

To address this challenge, we reasoned that the direct installation of a 
CF; group to a drug candidate—ideally at the position(s) of metabolic 
susceptibility—would obviate the need for pre-functionalization. Thus, 
this direct trifluoromethylation of aryl and heteroaryl compounds would, 
in a single chemical operation, preclude the need for redundant synthetic 
efforts and simultaneously protect against metabolic oxidation in vivo. In 
the first part of our design plan, we proposed that a single-electron 
pathway might provide such an approach to address the unsolved 


challenge of trifluoromethylation of non-activated arenes (Fig. Ic). 
Specifically, we sought to take advantage of photoredox catalysis, which 
provides a mild and efficient method for accessing electrophilic radicals, 
such as “CF;, by means of photosynthesis-inspired redox chemistry'*. 
This mode of reactivity employs polypyridyl organometallic complexes, 
such as Ru(phen);3Cl, (1 in Fig. 2, phen = phenanthroline), whose excita- 
tion by visible light (for example, a household light bulb) at room tem- 
perature provides a strongly oxidizing or reducing catalyst that can 
rapidly engage a variety of substrates to generate high energy, reactive 
species’®. 

Asa second part of our design plan, we recognized that the moderate 
to low nucleophilicity of many classes of unfunctionalized arenes has 
been typically underemployed in substitution reactions, presumably 
owing to the harsh conditions required to manipulate such aromatic 
rings via two-electron pathways (for example, nucleophilic or electro- 
philic aromatic substitution). As an alternative, we considered that a 
strategy based on radical addition could exploit this large area of 
aromatic structural space, thereby rendering a more general approach 
to CF; incorporation. Our strategy, outlined in Fig. 1, involves 
site-specific incorporation of electrophilic radicals at metabolically 
susceptible positions of arenes and heteroarenes—in analogy to the 
single-electron aryl modification processes employed by enzymes. This 
approach precludes the need for pre-functionalization of arenes, and 
offers a complementary method of accessing ary] trifluoromethyl func- 
tionality from late-stage synthetic intermediates. 
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Figure 1 | Direct trifluoromethylation of aryl and heteroaryl C-H bonds. 
The excretion of medicinal agents is facilitated by remote functionalization of 
aromatic moieties (a). A common medicinal chemistry approach to block this 
catabolism involves cross-coupling of CF; reagents to pre-functionalized 
arenes (b). In analogy to enzymatic processes, our photoredox strategy 

(c) enables the direct C-H functionalization of unfunctionalized arenes and 
heteroarenes with the CF; pharmacophore using a cheap and easy to handle 
“CF; source. 
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Building on our recent investigations of the «-trifluoromethylation 
of carbonyls with CF3I as a “CF; source’’, our initial exploration of a 
photoredox strategy towards the development of this transformation 
was successful within the limited context of m-rich arenes. We quickly 
observed, however, that reaction efficiency was diminished by com- 
petitive aryl iodination, presumably as a result of CF;-I homolysis. 
Our mechanistic understanding of this undesired pathway—wherein 
initiation of the photocatalytic cycle relies on sacrificial oxidation of an 
electron-rich arene—prompted us to seek an alternative method of 
catalytic initiation. Because direct reduction of CF3I (at a potential 
of —1.52V versus the saturated calomel electrode, SCE, in cyclic 
voltammetry)"* by visible-light-excited photocatalyst *Ru(phen)3~~ 
2 (Fig. 2; —0.90 V versus SCE)’? is thermodynamically challenging, 
we questioned whether the incorporation ofan electron-deficient system 
such as SO, within CF;X might enable more facile reduction’. Our 
investigations led us to the use of trifluoromethanesulphonyl chloride 
(known as triflyl chloride, TfCl), which has been employed in a non- 
general sense to effect aryl trifluoromethylation of benzene and other 
simple arenes via atom transfer catalysis*’**. In particular, the relative 
cost and ease of handling of TfCl in comparison to other “CF; sources 
rendered this a highly attractive reagent. 

For the proposed photoredox catalytic cycle, we assumed that 
single-electron transfer (SET) reduction of triflyl chloride should be 
concurrent with oxidation of *Ru(phen);"* 2 to Ru(phen);°* 3 
(Fig. 2). The ensuing CF;SO,Cl radical anion should then rapidly 
collapse to generate the stabilized “CF, a process that should be 
entropically driven by the release of SO, and chloride”. This electron 
deficient trifluoromethyl radical is then well-suited to add—in a selec- 
tive fashion—to the most electron-rich position of any arene or 
heteroarene””*. In the case of benzene, the resultant cyclohexadienyl 
radical 4 (Fig. 2; —0.1 V versus SCE)” should then undergo a second 
SET event with the now strongly oxidizing Ru’ * photocatalyst 3 (Fig. 2; 
+1.31V versus SCE) to regenerate ground-state photocatalyst 1. 


if 
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Pi CFs CF, 
R-L H att UH -H 
Z4 fs 


Figure 2 | Proposed mechanism for the direct trifluoromethylation of aryl 
C-H bonds via photoredox catalysis. The photoredox catalytic cycle is 
initiated via excitation of photocatalyst 1 to excited state 2 with a household 
light bulb. Subsequent reduction of triflyl chloride by reductant 2 (via single 
electron transfer, that is, SET) provides oxidant 3 along with the radical anion of 
triflyl chloride. This high energy species spontaneously collapses to form CF; 
radical (top left), which selectively combines with aromatic systems enabling 
direct CF; substitution. Catalyst 3-promoted oxidation of the ensuing radical 
4 completes the catalytic cycle and provides cyclohexadienyl cation 5, whose 
facile deprotonation provides the desired CF; arene. This mechanism is 
consistent with measured redox values (shown below respective compounds, in 
maroon). 
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Finally, facile deprotonation of the cyclohexadienyl cation 5 (Fig. 2) with 
a suitable base would provide the desired trifluoromethyl arene in a 
redox, catalytic fashion without the need for arene pre-functionalization. 

As illustrated in Fig. 3, we found that this new photoredox protocol 
allows the direct incorporation of CF; into a broad range of arene and 
heteroarene rings using only TfCl and a household light bulb. In con- 
trast to atom transfer pathways for trifluoromethylation that require 
high temperature (120°C), our photoredox strategy is successful at 
room temperature, allowing efficient production of the trifluoromethyl 
radical. Our initial studies revealed that exposure of five-atom hetero- 
cycles to a household 26-W fluorescent light source in the presence of 
triflyl chloride, Ru(phen)3;Cl, (1) photocatalyst, and base, enabled 
trifluoromethylation with excellent levels of efficiency (Fig. 3a). 
More specifically, the fluoroalkylation of pyrroles, furans and 
thiophenes occurred with excellent selectivity and yield (compounds 
6-12, 78-94% yield). Our rationale for the observed selectivity at the 
C2 position of these heteroaromatics relies on the formation of the 
conjugated (rather than cross-conjugated) radical and cationic inter- 
mediates. We also found that both mono- and bis-trifluoromethyla- 
tion could be obtained by varying the amount of CF3 source used (8 
and 9, 88 and 91% yield, respectively). Additionally, a wide range of 
ring substitution and heteroatom protecting groups are well tolerated 
(10-17, 70-87% yield). Electron-deficient five-atom heterocycles such 
as thiazoles are also amenable to this trifluoromethylation protocol, 
and useful levels of efficiency are obtained by increasing equivalents of 
the CF; source (14, 70% yield). 

We next turned our attention to six-atom heterocycles with the 
knowledge that there is an unmet need for CF; cross-coupling tech- 
nologies for these valuable pharmacophores. As shown in Fig. 3b, we 
found that an array of pyrazine, pyrimidine, pyridine and pyrone sub- 
strates are compatible with this new trifluoromethylation approach 
(18-32, 70-94% yield). We further observed that radical addition and 
the ensuing oxidation of electronically deficient halide-substituted 
heteroaromatics (such as 2,6-dichloropyrazine) were also operative. 
Unsubstituted analogues of each of these heterocycles can also be 
trifluoromethylated, albeit with somewhat diminished efficiency. 
Notably, the commercial photocatalyst Ir(Fppy)3; (Fppy = 2- 
(2,4-difluorophenyl)pyridine, see Supplementary Information for com- 
mercial sources, synthesis and photophysical properties) is capable of 
providing higher levels of reactivity (presumably due to a longer-lived 
excited state), although lower levels of regiocontrol are observed with 
this system (the role of catalyst reduction potentials in these outcomes 
is being explored). Finally, the trifluoromethylation of arenes (shown 
in Fig. 3c) demonstrates the complementary reactivity offered by this 
radical process. Ortho-trifluoromethyl containing anilines, anisoles, 
thioanisoles and xylenes—typically challenging motifs to access via 
cross-coupling strategies—are also readily prepared using this 
methodology (34-40, 70-84% yield). Tolerance of halides, multiple 
substitution and large steric bulk are additional benefits of this radical- 
based process (38-47, 72-92% yield). 

Our mechanistic hypothesis for the photoredox process described 
above is supported by emission quenching experiments as well as cyclic 
voltammetry. In the first case, no quenching interactions are observed 
between excited state 2 (Fig. 2) and a range of electronically diverse 
arenes and heteroarenes that are competent partners in this transforma- 
tion; whereas triflyl chloride exhibits significant fluorescence quenching 
(Stern—Volmer constant, 22 M?}; Supplementary Fig. 1). Furthermore, 
we have determined the oxidation potential of triflyl chloride (-0.18 V 
versus SCE; Supplementary Fig. 2) to be in accord with the proposed 
reduction step using excited state 2, as shown in Fig. 2. It is also 
noteworthy that decreased reactivity is observed in the absence of base, 
presumably due to HCl formation in the reaction. Inorganic bases are 
preferred to organic bases, as pyridines are trifluoromethylated and 
alkyl amines are prone to oxidative decomposition. 

Ultimately, a major benefit of our mild, visible-light-induced 
trifluoromethylation procedure is its amenability to late-stage synthetic 
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Figure 3 | Radical trifluoromethylation of five- and six-membered heteroarenes and C-H arenes via photoredox catalysis. Aromatic and heteroaromatic 
systems with varying stereoelectronics are efficiently trifluoromethylated under these standard conditions (top, generalized reaction). Substrate scope includes 
electron-rich five-atom heteroarenes (a), electron-deficient six-atom heteroarenes (b), and unactivated arenes (c). Isolated yields are indicated below each entry (PE 
NMR yields for volatile compounds). See Supplementary Information for experimental details. Abbreviations: X, Y, A, B, C represent either C, N, O, or $; MeCN, 
acetonitrile; Me, methyl; Boc, tert-butoxycarbonyl; Ac, acyl; tBu, tert-butyl. *The minor regioisomeric position is labelled with the respective carbon atom number. 


applications. To demonstrate this potential, biologically active molecules 
(Fig. 4) were subjected to our photoredox trifluoromethylation pro- 
tocol to obtain their CF; analogues. In some cases, highly regioselective 
addition is observed, presumably corresponding to positions of high 
electron density and potential metabolic susceptibility (Fig. 4a). In 
medicinal chemistry, the simultaneous identification and blocking of 
such positions with a CF; group may prove quite useful. For instance, a 
DNA base (uracil) analogue, an Alzheimer’s drug (Aricept) precursor, 
and a vitamin (flavone) were each selectively trifluoromethylated at a 
single position with excellent efficiency (85-94% yield). The first 
example represents a facile, one-step synthesis of a known CF;3-uracil 
derivative from its abundant non-fluoro analogue. These uridines have 
been extensively studied owing to their anti-viral and cancer-treating 
properties”. 

Alternatively, a successful drug discovery programme is likely to 
eliminate potential sites of metabolic reactivity from a medicinal agent 
during the course of its development”. Without an especially electron- 
rich m-system present, competitive trifluoromethylation of the 
remaining aromatic sites should be observed. Not surprisingly, radical 
trifluoromethylation of lidocaine and ibuprofen results in alkylation at 
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both of their available aromatic positions (Fig. 4b). Subjection of the 
cholesterol-lowering statin, Lipitor, to our protocol similarly results in 
promiscuous functionalization at three positions (Fig. 4c; 74% yield, 
1:1:1). In the context of drug discovery, the isolation of these separate 
isomers offers the potential for rapid testing of fluorinated analogues 
for biological activity from a single late-stage synthetic product. This 
streamlined process represents an attractive alternative to multi-step, 
parallel syntheses. Perhaps most importantly, this technology provides 
a strategically new approach to the introduction of metabolism- 
blocking fluoroalkyl groups at the end of a medicinal chemistry 
development effort. 

In this work, we have introduced a photoredox-based method that 
allows for facile trifluoromethylation of aromatic and heteroaromatic 
systems without the need for aryl ring pre-activation. The value of this 
transformation has been highlighted via the trifluoromethylation of 
biologically active molecules with commercially available photocatalysts 
and a household light bulb. While this manuscript was under review, a 
complementary method for heterocyclic trifluoromethylation was 
reported by others*’. Their approach also employs radical inter- 
mediates to harness innate chemical reactivity of unactivated 
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Figure 4 | Direct trifluoromethylation of biologically active molecules. 
Subjecting common medicinal agents and other biologically active molecules to 
our standard photoredox protocol enables direct CF; installation selectively at 
metabolically susceptible positions of some molecules (a). Alternatively, C-H 
functionalization of more metabolically stable medicines occurs non- 
selectively, allowing for rapid access to drug analogues (b). The promiscuous 
modification of equally reactive 2-systems, such as the arenes in Lipitor, may be 
followed by separation of isomers (for example, via supercritical fluid 
chromatography, SFC) for rapid screening of biological activity (c). 


substrates, although their use of peroxides as a radical initiator affords 
divergent substrate tolerance and reaction efficiency. We expect these 
two simple protocols to be of broad utility for the synthesis and 
development of new medicinal agents. 


METHODS SUMMARY 


General procedure. An oven-dried vial was equipped with photocatalyst 
(1-2 mol%), dry K,HPO, (3 equiv.), arene or heteroarene (0.5 mmol, 1 equiv.) 
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and MeCN (0.125 M), and degassed by alternating vacuum evacuation and argon 
backfill at —78 °C. The triflyl chloride (1-4 equiv.) was then added and the solu- 
tion was stirred at room temperature adjacent to a 26-W compact fluorescent light 
bulb. After 24h, the reaction mixture was purified by column chromatography to 
furnish the desired CF; product. Full experimental details and characterization 
data (‘H, °C, °F NMR, infrared spectroscopy, high-resolution mass spectro- 
metry) for all new compounds are included in Supplementary Information. 
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Combined obliquity and precession pacing of late 


Pleistocene deglaciations 


Peter Huybers' 


Milankovitch’ proposed that Earth resides in an interglacial state 
when its spin axis both tilts to a high obliquity and precesses to 
align the Northern Hemisphere summer with Earth’s nearest 
approach to the Sun. This general concept has been elaborated into 
hypotheses that precession’, obliquity** or combinations of both** 
could pace deglaciations during the late Pleistocene”. Earlier tests 
have shown that obliquity paces the late Pleistocene glacial 
cycles*” but have been inconclusive with regard to precession, 
whose shorter period of about 20,000 years makes phasing more 
sensitive to timing errors*’’’*. No quantitative test has provided 
firm evidence for a dual effect. Here I show that both obliquity and 
precession pace late Pleistocene glacial cycles. Deficiencies in time 
control that have long stymied efforts to establish orbital effects on 
deglaciation are overcome using a new statistical test that focuses 
on maxima in orbital forcing. The results are fully consistent with 
Milankovitch’s proposal but also admit the possibility that long 
Southern Hemisphere summers contribute to deglaciation. 

During the late Pleistocene—roughly over the past million years— 
Northern Hemisphere continental ice has alternately covered much of 
northern North America and Fennoscandia and then retreated to 
today’s relatively ice-free conditions at intervals of approximately 
100,000 years (100 kyr). The cause of these massive shifts in climate 
remains unclear not for lack of models, of which there are now over 
thirty’"'°'*4, but for want of means to choose among them. Previous 
statistical tests have demonstrated that obliquity paces the ~100-kyr 
glacial cycles*”’, helping narrow the list of viable mechanisms, but have 
been inconclusive with respect to precession (that is, P > 0.05) because 
of small sample sizes and uncertain timing*'*””. 

Whether precession influences the ~100-kyr glacial—interglacial 
cycles is not obvious. Precession alters diurnal average insolation 
intensity by as much as 30 Wm ~* ona given day of the year, suggest- 
ing a powerful forcing, and its signature clearly appears in proxy 
records of temperature and ice volume at ~20-kyr periods’. 
However, its insolation anomalies are exactly counterbalanced across 
the seasons so that annual insolation at any latitude is independent of 
precession’. Furthermore, proxies of early-Pleistocene glaciation 
show strong obliquity and little precession variability, indicating that 
precession had negligible influence during this next-most-recent 
epoch of glaciation’®, though see ref. 17 for another view. 

Here, I test whether anomalously large combinations of precession 
and obliquity forcing combine to determine when deglaciations 
occurred during the late Pleistocene. The test involves three steps. The 
first is to estimate the timing of terminations, for which I use a composite 
6'°O record whose chronology is derived by linearly interpolating age 
with depth between the last deglaciation and radiometrically dated geo- 
magnetic reversals''. Timescale uncertainty over the past million years is 
estimated by running a stochastic sediment accumulation rate model" 
that also accounts for uncertainties in the alignment of features between 
6'80 stratigraphies, decompaction of sediment, transport times of 6'°O 
within the ocean, and geomagnetic reversal ages. The age of the 
Matuyama-Brunhes geomagnetic reversal was earlier assumed to be 
known to within +2kyr (one standard deviation, 1s.d.)"', but to 


account for uncertainty in the “°K decay constant'*”’, it is now 
represented as occurring at 780 + 8kyr (1s.d.). Terminations are 
identified by local maxima in the time rate-of-change of the 5'*O 
record that exceed a value of 0.095%o per kyr, giving the usual ter- 
mination features” except that termination 3 contains two parts that 
are labelled 3a and 3b (Fig. 1a). (Thresholds ranging between 0.07% 
and 0.17%o per kyr would give different numbers of terminations but 
give similarly significant results.) The average uncertainty in the age of 
the 12 identified termination features is 8 kyr (1 s.d.), with older ages 
generally being more uncertain. 

The second step is to define an insolation forcing function, of which 
there are many varieties'*”*. For present purposes only the relative 
shape of the forcing function is needed, and a generic and broadly 
representative formulation’ can be adopted: 


Fy =o? sin(a,—) + (1—a) 7a (1) 


Here e represents eccentricity, w is the angle from vernal equinox to 
perihelion, ¢ is obliquity, subscript ‘t’ indicates time, and ¢ and « are 
adjustable parameters that respectively control the phase of precession 
and the relative contributions from precession and obliquity. Both 
e,sin(c, — p) and é, are normalized to zero-mean and unit variance such 
that F; also has unit variance. Milankovitch', along with many sub- 
sequent authors” ””'®!°, called upon anomalies in incoming solar radi- 
ation during the Northern Hemisphere summer to determine whether 
the Northern Hemisphere is glaciated. Increased insolation intensity 
during Northern Hemisphere summer results from greater obliquity 
anda phase of precession that brings Earth closer to the Sun during that 
season, and can be represented by setting ¢ = 0° and «=0.5. The 
resulting structure of F; (Fig. 1b) shares more than 99% of its variance 
in common with both Milankovitch’s caloric summer half-year insola- 
tion’ at 65° N and summer energy’® at 65° N when using a threshold of 
350 W m ~, providing a suitable representation of the hypothesis that 
Northern Hemisphere summer insolation controls glaciation. 

In the third and final step, forcing maxima in F; that correspond most 
closely in time with each termination are compared against forcing 
maxima not associated with terminations (Fig. 1b). In particular, the 
median value of the non-termination maxima is subtracted from the 
median value of the termination maxima, yielding 6m = 0.89. Unlike the 
Rayleigh’s R statistic (relied upon for previous tests of orbital influence 
upon deglaciation*’’'*”"), timing errors do not affect 6m unless they 
cause the wrong forcing cycles to be identified. Median values are also 
less sensitive to timing errors because outliers generally have no effect. 

The significance of 6m is assessed within the context of a null 
hypothesis Ho, that termination timing is independent of F;, and an 
alternative hypothesis H,, that terminations tend to occur when the 
maxima in F; are anomalously large. For purposes of comparison with 
earlier work’, a modified random walk representing ice-volume vari- 


ability is adopted for the null hypothesis: 
D=Ur-1 +, and if vu, >h, then terminate (2) 


Ice volume v;, accumulates by a random increment 7; during each 1-kyr 
time step, until a threshold h, is passed, and the termination of all ice is 


Harvard University, Department of Earth and Planetary Sciences, 20 Oxford Street, Cambridge, Massachusetts 02138, USA. 


8 DECEMBER 2011 | VOL 480 | NATURE | 229 


©2011 Macmillan Publishers Limited. All rights reserved 


LETTER 


8'8O anomaly 


b 

Sir iA ee eee aS ete es Ne 5 lt eS le 
= 

rs) 1 = = A _ oe = _f a 
: | 

ne) 0 

N 

FE: -1 

5-2} 

a 


-3 
-1,000 -900 -800 -700 -600 -500 -400 -300 -200 -100 0 


Time (kyr) 


Figure 1 | Test of the Milankovitch hypothesis. a, Anomalies in 6'°O ona 
depth-derived timescale'’. Terminations are identified when rates of change 
exceed 0.95%o per kyr and are pinpointed at their maximum rate of change (red 
dots, with 2 s.d. time-uncertainty indicated by the horizontal bars). b, Orbital 
forcing containing equal amounts of obliquity and climatic precession 
variability and the contribution of climatic precession peaking during the 
Northern Hemisphere summer solstice. The forcing maximum nearest each 


prescribed to occur over the ensuing 10 kyr. Defining h, = 90 units, 
randomly initializing ice volume between 0 and 90units, and 
drawing 7, from a normal distribution with unit mean and two units 
of standard deviation gives glacial cycles with a period of 100 + 20 kyr, 
consistent with observations*. Terminations are identified from 
realizations of v; in the same manner as for the 5'80 record, except 
that v, is first smoothed using an eleven-point running average, and 
rates of ice volume change must be less than —5 units per kyr. Given 
these random realizations of termination times, a value of 6m is com- 
puted from F;, and the process repeated 10° times to build up a 
distribution for Hp (Fig. 1c). This null-distribution of 6m is shifted 
towards positive values because of a tendency for large-amplitude 
forcing cycles to have a longer period, making Ho harder to reject than 
if the nonparametric Wilcoxon rank sum test” had instead been used. 

Hy, is similarly derived from equation (2), but after modifying 
the threshold condition to be sensitive to insolation forcing, 
h, =110—25F,, and prescribing 7, to have unit variance, which again 
yields glacial cycles of 100 + 20 kyr. Under H;, terminations tend to 
trigger when both ice volume and F; are large. Termination times 
calculated according to H, are then perturbed using realizations from 
the stochastic sediment accumulation model consistent with the errors 
estimated for the depth-derived ages (see Fig. 1a). 

The statistical power of the test is measured as the probability of 
rejecting Hy at the P = 0.05 significance level if H, is correct. In the 
absence of timing errors, the statistical power would be 0.95, but even 
when timing errors are accounted for, it is still 0.67. Although the power 
of the test depends upon the form and parameters chosen for the null 
and alternative models, these results indicate that, unlike for previous 
tests*'''?, the combined influence of obliquity and precession will be 
identifiable, if present. Indeed, the observed value of dm = 0.89 is highly 
significant as judged against Hp (P = 0.002) and is near the maximum 
likelihood value of H, (Fig. 1c), even after accounting for errors in timing. 
Therefore, it is possible to reject the null hypothesis of no orbital influ- 
ence with a high degree of confidence, and the results are consistent with 
precession and obliquity together exerting control upon deglaciation. 

To explore the sensitivity of these results to different formulations of 
the orbital forcing, I repeated the test for F, computed using all phases 
of precession and all mixtures of obliquity and precession (Fig. 2). Such 
combinations reproduce the variability expressed by other measures of 
insolation forcing, including the caloric summer half-year', summer 
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termination is indicated (red dots), and the median across all such maxima (red 
dashed line) is 0.89 normalized units greater than the median of the maxima 
not associated with terminations (black dots and dashed line). c, The difference 
in the medians (vertical grey bar) is highly significant (P = 0.002) as judged 
against the null hypothesis (Hp, black with the 5% rejection region filled). The 
difference in medians is also consistent with the alternative hypothesis (H,, red 
lines), whether timing errors are ignored (dashed) or accounted for (solid). 


energy'® (as long as thresholds are not set near peak intensity), and 
insolation at any particular latitude and on any particular day of the 
year~*. The resulting contour of significance levels shows highly sig- 
nificant results (P < 0.01) in the vicinity of perihelion aligning with the 
Northern Hemisphere summer solstice and equal contributions from 
obliquity and precession, though significant results (P < 0.05) are also 
found for perihelion occurring a month earlier or several months later 
and forcing mixtures having « between 0.3 and 0.9. This shows that 
any formulation of F, in general agreement with Milankovitch’s 
hypothesis would also give a positive result. 

To specifically inquire into the ability to reject the null hypothesis of 
obliquity-only pacing of the glacial cycles, I adopt a null hypothesis 
wherein the threshold in equation (2) varies according to 
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Figure 2 | Sensitivity of the test to choice of orbital forcing. Most orbital 
forcing curves can be represented as a linear mixture of obliquity and climatic 
precession (x axis, giving the fraction of precession variance), where climatic 
precession can take on any phase (y axis, where 0° indicates greatest forcing 
during the Northern Hemisphere summer solstice). The test’s P-value is 
contoured after applying it to each forcing function. Values less than 0.05 are 
significant, and the results are consistent with Milankovitch’s hypothesis. 
Forcing functions with a negative obliquity contribution (not shown) are never 
significant. A two-dimensional smoother was applied to the logarithm of the 
P-values to make the plot easier to interpret while preserving small values. 
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Figure 3 | Orbital forcing of glacial cycles. a, A deterministic model of the 
glacial cycles (red line) using the same threshold condition (grey line) for the 
alternate hypothesis. For reference, a scaled version of the composite 5°0 
record is also shown (black line). With the exceptions of terminations 3a, 8 and 
9 the timing and amplitude of the glacial cycles are generally reproduced. b, The 


h, = 105 — 15&, a formulation consistent with ref. 4 wherein obliquity 
paces the timing of deglaciation. This formulation shifts the null dis- 
tribution towards higher values but nonetheless shows 6m = 0.89 to be 
significant (P = 0.02), a result that also holds when any other com- 
bination of mean and amplitude is specified for the threshold. 
Likewise, adoption of a precession-only null hypothesis invariably 
leads to 6m = 0.89 being significant (P< 0.01), regardless of specifica- 
tions for the mean and amplitude of the threshold and the phase of 
precession. (The obliquity-only null is harder to reject than the pre- 
cession-only null because it leads to selection of forcing maxima in F; 
that tend to have the positive phases of both obliquity and precession 
aligned.) These results show that glacial hypotheses calling singly on 
precession’ or obliquity**, or not accounting for either'*"*, should be 
rejected in favour of combined obliquity and precession pacing of late 
Pleistocene glacial cycles***"°. 

I also repeated the test after updating the depth-derived age model 
using a compilation of available radiometric constraints upon the age 
of the last four terminations™. The most significant revision is to shift 
the ages of terminations 3a and 3b to align with older precession 
maxima, which increases the mean of the deglacial forcing maxima 
but has no effect on the median. Therefore, the revised ages do not 
change the test results, reflecting the test’s overall robustness to timing 
errors. The data and code used to generate each of the above results are 
available in the Supplementary Information. 

The above statistical results can be succinctly illustrated by replacing 
the stochastic variable in the alternative hypothesis with its mean value 
and initializing ice volume to be 25 units at —1,000 kyr. The resulting 
deterministic time-history of glaciation gives a cross-correlation of 
0.73 with the negative of the 5'°O record, which is very high relative 
to other simple model fits given the small number of adjustable para- 
meters* (Fig. 3a). The model result reproduces the basic sawtooth 
structure of the late-Pleistocene glacial cycles over the past million 
years in terms of timing and amplitude, except for terminations 8 
and 3b, and illustrates how obliquity, precession and the growth of 
ice volume can combine to control late-Pleistocene glacial cycles. 
Another implication is that the climate will tend to be driven out 
of glaciation sooner when eccentricity—and hence the amplitude of 
precession—is larger. Because interglacial values of 6'°O are less 


insolation anomaly (in W m_ ”) that drives deglaciations. Anomalies are 
computed as the average insolation across the forcing maxima associated with 
terminations (see Fig. 1) relative to the average insolation over the past million 
years. Note that anomalies are shown relative to day of the year, not solar 
longitude”®. 


variable than glacial values, this relationship implies that the amplitude 
of the ~100-kyr glacial cycles are smaller when eccentricity is larger, 
consistent with the findings reported in ref. 21. 

Combined orbital pacing is also consistent with earlier findings that 
the intervals between successive deglaciations cluster into 80-kyr or 
120-kyr periods*"', indicative of two or three obliquity cycles. 
Precession, with its ~20-kyr period, achieves a maximum during nearly 
every interval of above-average obliquity, given the ~40-kyr period of 
obliquity, but the reverse does not hold. Therefore, precession will tend 
to influence the precise timing of a deglaciation within an obliquity 
cycle, but obliquity will more fundamentally govern the interval 
between deglaciations. Whether precession also influences the precise 
timing of terminations during the early Pleistocene, when deglaciations 
occur more nearly every 40 kyr, remains an open question'®’”. Perhaps 
terminations then occurred nearly every time precession maxima 
coincided with above-average values of obliquity; such a scenario can 
be obtained by setting the threshold in equation (2) to lower values. 

Ice sheets tend to collapse in response to unusually large maxima in 
insolation forcing that result from the coincidence of high obliquity 
and alignment of perihelion with Northern Hemisphere summer 
solstice, consistent with the models hypothesized by Milankovitch' 
and others*’°. During these forcing maxima, summer insolation is as 
much as 40 W m ~ greater at high northern latitudes (Fig. 3b). However, 
this consistency is not exclusive of all other orbital contributions to 
deglaciation. For instance, when perihelion aligns with the Northern 
Hemisphere summer solstice, aphelion occurs during the Southern 
Hemisphere summer, causing the length of the Southern Hemisphere 
summer to be longer (Fig. 3b) and, possibly, increasing the escape of CO, 
from the Southern Ocean into the atmosphere**”’. The climate system is 
thoroughly interconnected across temporal and spatial scales, and, just 
as neither obliquity nor precession act in isolation, no one region should 
be expected to exert exclusive influence upon deglaciation. 
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Frictional ageing from interfacial bonding and the 
origins of rate and state friction 


Qunyang Li't, Terry E. Tullis’, David Goldsby” & Robert W. Carpick' 


Earthquakes have long been recognized as being the result of stick- 
slip frictional instabilities’*. Over the past few decades, laboratory 
studies of rock friction have elucidated many aspects of tectonic 
fault zone processes and earthquake phenomena®”. Typically, the 
static friction of rocks grows logarithmically with time when they 
are held in stationary contact®, but the mechanism responsible for 
this strengthening is not understood. This time-dependent 
increase of frictional strength, or frictional ageing, is one mani- 
festation of the ‘evolution effect’ in rate and state friction theory’. 
A prevailing view is that the time dependence of rock friction 
results from increases in contact area caused by creep of contacting 
asperities”*. Here we present the results of atomic force microscopy 
experiments that instead show that frictional ageing arises from 
the formation of interfacial chemical bonds, and the large mag- 
nitude of ageing at the nanometre scale is quantitatively consistent 
with what is required to explain observations in macroscopic rock 
friction experiments. The relative magnitude of the evolution 
effect compared with that of the “direct effect—the dependence 
of friction on instantaneous changes in slip velocity—determine 
whether unstable slip, leading to earthquakes, is possible”. 
Understanding the mechanism underlying the evolution effect 
would enable us to formulate physically based frictional constitu- 
tive laws, rather than the current empirically based ‘laws’'’”, 
allowing more confident extrapolation to natural faults. 

Frictional ageing has been attributed to increases in contact area 
caused by asperity creep (contact ‘quantity’) as well as to time-dependent 
strengthening of bonding at asperity contacts (contact ‘quality’). 
Evidence exists to support both interpretations. By measuring the 
amount of light transmitted across rough Lucite plastic surfaces in con- 
tact, and rough soda-lime glass surfaces in contact, Dieterich and Kilgore® 
observed gradual increases in the sizes of illuminated microscopic con- 
tacts over time. Experiments on a silicate rock (quartzite) have demon- 
strated that frictional ageing is suppressed by drying the samples and 
conducting friction experiments in a water-free environment’. Because 
creep is inhibited in experiments on silicate minerals like quartz at high 
temperature in the absence of water (that is, in the absence of hydrolytic 
weakening"’), these observations are consistent with the hypothesis that 
static friction increases with changes in contact area caused by asperity 
creep’. However, frictional ageing may also result from strengthening of 
chemical bonding at the interface over time. Chemical bonding could be 
enhanced by time-dependent desorption of contamination films’ that 
may include water, or through chemically assisted mechanisms (such 
as hydrogen bonding or siloxane bridging)'*"* that can be aided by 
water and also affected by the contact stresses. 

Devising experiments to understand the contribution of each mech- 
anism to ageing is exceedingly difficult, because the buried frictional 
interface is not readily accessible and involves myriad microscopic 
asperity contacts with a range of sizes down to the nanoscale’’”®. 
Here, we investigate the ageing behaviour of a single nanoscale contact 
to isolate the physical origins of the ageing process and provide new 
perspectives on the empirical rate- and state-dependent friction laws. 


We conducted single-asperity slide-hold-slide (SA-SHS) friction 
experiments between nanoscopically sharp oxidized silicon tips and 
oxidized silicon wafers using an atomic force microscope (AFM); these 
tests are analogous to similar slide-hold-slide tests on rocks**. The use 
of amorphous SiO, (silica) to investigate the behaviour of rocks com- 
posed of silicate minerals is reasonable, because silica glass and rocks 
behave similarly in friction experiments on rough surfaces”’. 

Figure 1 shows an example of SA-SHS measurements performed at 
60% relative humidity (RH). Similar to slide-hold-slide experiments 
on rocks*®, the static friction force is larger than the steady-state sliding 
friction force by an amount AF, called the friction drop, that grows 
linearly with the logarithm of the hold time (inset to Fig. 1). Similar 
trends are obtained at other humidities. The linear dependence of AF 
on the logarithm of hold time is consistent with observations from 
friction experiments on macroscopic rocks and other materials®’*, but 
the magnitude observed here is far larger. The ratio between AF and 
the steady-state friction F,, (AF/F,,), which we call the ‘relative friction 
drop’, ranges from 0.5 to 5 over the range of humidity studied for 100-s 
hold times, compared to less than 0.05 for a 100-s hold time observed 
for macroscopic quartz rock samples'®. This unusually large ageing 
effect is reproducible for all the SA-SHS experiments, with more than 
five distinct silica—silica pairs tested. 

The much larger ageing of the relative friction drop at the nanoscale 
than at the macroscale suggests that frictional ageing may be a length- 
scale-dependent phenomenon influenced by the multi-asperity 
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Figure 1 | Lateral force versus nominal lateral displacement data for typical 
SA-SHS tests after stationary holds at 60% RH. Upon lateral displacement, 
the tip sticks to the substrate, resulting in linear, elastic lateral loading of the 
AFM cantilever (Supplementary Figs 1 and 2). When the lateral force exceeds 
static friction, the tip slips forward, indicated by abrupt drops in lateral force 
(AF), followed by subsequent sliding at the steady-state friction force (F,s). In 
the inset AF varies linearly with the logarithm of hold time. The dotted line is a 
linear fit of the averaged values. 
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character of rough surfaces. To test this hypothesis, we construct a 
simplified mechanics model to predict friction as a function of nominal 
contact size for a fixed number of discrete asperities per unit area 
(Fig. 2). Comparing a small (19 asperities) and large (1,459 asperities) 
nominal contact area, with the same individual asperity (i) relative 
friction drop (AF‘/F!, = 1), the total macroscopic relative friction drop 
(AF! /Fiotal) values are very different: 0.81 for the smaller nominal 
contact and 0.15 for the larger nominal contact. The behaviour can be 
understood as follows. Owing to elastic interactions between asperities 
through the substrate, the asperities do not experience the same shear 
stress and thus do not slide simultaneously; the shear stress rises from 
the contact centre outward, until slip occurs. For sufficiently small 
nominal contact areas, asperities slide nearly at the same time, so 
macroscopic ageing is similar to individual asperity ageing. However, 
for larger nominal contacts, only a small fraction of asperities experi- 
ence the peak forces at any given time: those in the outermost annulus 
have slid and thus relaxed the peak stress to the much lower steady- 
state sliding value, thus providing little extra resistance, whereas those 
in the centre have not yet been stressed to the peak shear stress, thus 
providing little contribution to the measured friction force. Therefore, 
only a small internal annulus of asperities exhibit the peak shear stress, 
and hence AF’! /F'et@! of a nominally micrometre-sized contact inter- 
face that has nanoscale asperities will be less than 10% of the corres- 
ponding values for the single asperities. 

Although the model is highly idealized, it demonstrates a general 
behaviour: larger nominal contacts will generally have a much lower 
macroscopic ageing effect than that measured for individual asperities 
within the nominal contact area regardless of the specific mechanism(s) 
for ageing. This can account for the different magnitudes of frictional 
ageing seen in rock friction experiments on rough surfaces®"® and in the 
single-asperity experiments. This scale dependence is somewhat ana- 
logous to that observed for adhesion: small objects adhere to surfaces 
far more strongly (relative to their weight) than macroscopic objects, 
owing to the numerous asperities on rough surfaces of large objects”. 


AFIFI,, = 100% 


AFtotal / F_ total — 81% 
ss 


AFtotal / Fist) =15% 


Figure 2 | The quasi-static discrete asperity model. a, An interface between a 
rigid top surface and an elastic half space with a circular nominal contact area 
encompassing identical close-packed circular asperities at fixed spacing (small 
circles), all with maximum static friction force Fy, (with AF'/F!, = 1). Upon 
lateral loading by F°°“", asperities are uniformly elastically strained up to their 
maximum static friction, whereupon they slip (see Supplementary 
Information). The shear force distributions at the peak frictional force for 19 
(b) and 1,459 (c) asperities result in macroscopic relative static friction 
(AF! /Ftet!) that is high (0.81) for a but much lower (0.15) for c. 
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Although the unusually large ageing effect at the single-asperity level 
revealed by our experiments is required to explain the frictional ageing 
observed in friction experiments on rough macroscopic rock surfaces, 
the results presented thus far do not identify the underlying cause of 
this large ageing effect. In the present SA-SHS experiments, we found 
that ageing can depend critically on the chemical state of the AFM tip 
(Fig. 3). We found that frictional ageing was substantially but reversibly 
suppressed when the AFM tip was pulled out of contact and exposed to 
the humid environment before undergoing an SA-SHS test; ageing 
could be revitalized by sliding the tip across the substrate for some 
cumulative run-in sliding distance (and by not breaking contact imme- 
diately before an SA-SHS test). While the tip is out of contact, the tip is 
susceptible to adsorption of gaseous species, including organics (well 
known to reduce the surface energy of silica surfaces)”*, whereas during 
scanning of the tip on the substrate, such adsorption is suppressed and 
adsorbates can be removed. This clearly indicates that plastic deforma- 
tion of the tip is not involved in the frictional ageing observed in the 
SA-SHS tests, given that lifting up and sliding the tip would not influ- 
ence plastic deformation of the tip in such a reversible manner. 

The surfaces of oxidized silicon samples cleaned with Piranha solu- 
tion (5:1 concentrated sulphuric acid to 30% hydrogen peroxide solu- 
tion) are highly hydroxylated**”’. In a humid environment, hydrogen 
bonding can arise between two hydroxylated silica surfaces'”'*”*, 
which, together with van der Waals and capillary interactions, will 
increase the overall interfacial adhesion and friction. Furthermore, at 
raised temperatures the silanol (Si-OH) groups from opposing sides of 
contacting surfaces can react to form stronger siloxane (Si-O-Si) 
bonds'’*°. At room temperature (typically 23 °C), with applied load 
and sufficient time, the reaction may also proceed”’, but the kinetics 
have yet to be studied in detail. Capillary interactions are unlikely to be 
a dominant mechanism because of the persistence of ageing at humidities 
below 1% RH (see Supplementary Fig. 5). Therefore, we hypothesize that 
frictional ageing is caused by hydrogen or siloxane bond formation 
across the interface. 

To test this hypothesis, we performed another set of SA-SHS tests 
by sliding a silica tip over two chemically inert surfaces: hydrogen- 
terminated diamond, and graphite. The diamond sample is grown by 
chemical vapour deposition (typical grain size ~1 1m) and subse- 
quently exposed to atomic hydrogen using an established method’ 
that produces a highly uniform, nearly oxygen-free, monohydride 
surface before the friction experiment. The graphite surface was pre- 
pared by freshly cleaving a bulk highly ordered pyrolytic graphite 
sample. Interfacial bonding is not expected between silica and either 


a b c 
Tip lifted out of e 
35 AF ie contact for 30 peal 
we 215 
Te ss 
c 
2 
a) 
5 
oO 
N 
oj 
E 
So 
Zz 
0 T T Vt T T me at T T 1 
0 30 60 90 0 30 60 90 0 30 60 90 


Lateral displacement, D (nm) 


Figure 3 | Normalized friction-displacement curves from three sequential 
SA-SHS tests. a, Typical, strong ageing is seen immediately after a first run-in 
scan (cumulative displacement of ~1 mm); AF/F,, = 1.5. b, Ageing is 
substantially suppressed after 30 min of exposure of the tip to the humid 
environment, out of contact with the sample, followed immediately by the SA- 
SHS test shown; AF/F,, = 0.5. c, Large ageing returned to the original value after 
sliding for another run-in scan; AF/F,, = 1.5. All tests used a 100-s hold time at 
40% RH, with a negative lateral hold force (see Supplementary Information). 
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H-terminated diamond or graphite. The normalized friction versus 
displacement plots from SA-SHS tests (100-s hold time) are shown 
in Fig. 4 and compared with data from the oxidized silicon surface. In 
contrast to the significant frictional ageing (static friction increases by 
a factor of six compared to the steady-state friction) for the silica-silica 
surface, the silica-diamond interface and the silica-graphite interface 
show little to no ageing. Contact stresses are estimated to be about 
0.2 GPa, 0.4GPa and 1.5GPa for silica-silica, silica-graphite and 
silica~diamond contacts respectively (see Supplementary Fig. 8). If 
plastic creep occurs for the silica-silica contact, we would also expect 
it in the tip for all of the systems, leading to a similarly high degree of 
ageing (perhaps even more for the silica-diamond contact where the 
contact stresses were largest). However, given the indentation hardness 
of amorphous silica of approximately 12 GPa (ref. 27), we would not 
expect plastic creep in the tip for any of these material pairs at the 
normal stresses involved. Given both this expectation and the results 
shown in Figs 1, 3 and 4, our experiments clearly demonstrate that the 
frictional ageing effect in SA-SHS tests does not primarily arise from 
plastic creep, but from changes in bonding arising from surface 
chemical processes. 

In conclusion, we find that frictional strength for AFM tips of silica 
sliding on a silica substrate increases linearly as a function of the 
logarithm of contact time, in the same fashion as the frictional 
strengths of silicate minerals in macroscopic friction experiments. 
However, the magnitude of the ageing effect for the nanoscale single- 
asperity contacts is much larger than for macroscopic multi-asperity 
rock friction experiments. We demonstrate that this difference should 
be expected given the scales and single- versus multi-asperity geometries 
of the two cases regardless of the specific mechanisms for ageing. At the 
nanometre scale and at relatively low contact stresses, frictional ageing 
cannot be explained by increases in contact area due to plastic creep. 
Rather, it is associated with changes in chemical bonding as demon- 
strated by the dependence of ageing on the identity of the samples that 
are in contact with the AFM tips, and the reversible suppression of 
ageing by removing the tip from otherwise continuous contact with 
the sample. This reduction of ageing due to exposure of the interface 
to ambient conditions may contribute partially to the low frictional 
ageing observed in macroscopic rock friction experiments. In such 
experiments, both contacting surfaces are usually rough and the local 
asperities can be exposed to water or other contaminants intermittently 
during sliding. However, this effect alone is insufficient to account 
for the very large difference in ageing we observe; variations in 
relative frictional ageing before and after breaking the contact within 
our experiment is typically between 0.5-5, whereas the difference 
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Figure 4 | Three SA-SHS tests between a silica tip and three different 
surfaces. a, Silica-silica; b, silica-hydrogen-terminated diamond; ¢, silica- 
graphite. The tests were all performed at 60% RH for a 100-s hold time. The 
normal load in each case is maintained at approximately 1 nN. The lateral 
forces during stationary hold are negative (see Supplementary Information). 
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between our results and macroscopic experiments is two orders of 
magnitude. 

The magnitude of ageing from these chemical-bonding effects is 
large enough to account for the evolution effect in rate and state 
friction without a time-dependent increase in contact area. Whether 
the primary cause of frictional ageing in macroscopic friction experi- 
ments is an increase in the quantity and strength of chemical bonding 
or an increase in contact area due to plastic creep is unresolved by our 
data. However, we have demonstrated that chemical bonding is a 
viable explanation, although we have not determined the specific 
chemical mechanism that leads to such a slow, logarithmic ageing. 
The logarithmic behaviour demonstrates that the rate of reaction is 
slowing down as a function of time. Further experimental and theor- 
etical studies are needed to address the mechanism(s) that account for 
this. 


METHODS SUMMARY 


Experiments were conducted by sliding silica AFM tips over silica surfaces using 
friction force microscopy (see Methods and Supplementary Fig. 1). The probes 
and substrate are processed by heating silicon probes and Si(100) wafers in pure 
oxygen to form oxides 100 nm and 400 nm thick respectively. The substrates were 
cleaned with Piranha solution, rinsed with de-ionized water, dried under pure 
nitrogen flow, then rapidly inserted into the AFM. The substrates have a typical 
root-mean-square roughness of ~0.2 nm within a 0.5 um X 0.5 um area, accord- 
ing to the AFM. Normal and lateral force constants of the probes were calibrated 
using Sader’s method”* and a diamagnetic lateral force calibrator” respectively. All 
measurements are carried out in a sealed chamber with RH varied from <1% to 
80%. The system was equilibrated for at least 4h after each humidity change to 
reduce thermal positional drift, which is particularly important when long hold 
times are used. The remaining drift may contribute to the increased scatter seen at 
longer hold times. This increased scatter may also be due to higher sensitivity of 
the interfacial strength to any inhomogeneities at the interface. RH was monitored 
by a hygrometer (uncertainty +1.5% RH). A typical friction experiment is per- 
formed by first making contact between the tip and substrate, then sliding for an 
initial run-in distance. Then, without breaking contact, a slide-hold-slide procedure 
is used in which the tip is slid for a prescribed distance (typically ~400 nm), then 
held stationary for a time fy,o1a, and then lateral displacement is resumed. Constant 
normal load is imposed by a feedback loop. Four different hold times, in an altern- 
ating sequence of 0.1s, 100s, 1s and 10s, are used. The lateral force during the 
stationary hold was maintained at either a positive or negative value (see Methods, 
Supplementary Figs 2, 6 and 7 and associated discussion). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Friction force microscopy. Friction force microscopy was conducted using an 
RHK UHV350 AEM. A schematic of the setup of a friction force microscope is 
shown in Supplementary Fig. 1. The microscope consists of a sharp tip in contact 
with the surface of the sample. The tip is integrated with the end of a micro- 
fabricated cantilever that deforms in response to normal and lateral (friction) 
forces arising from the contacting interface. Deformation of the cantilever is 
measured with an optical system consisting of a laser and a quadrant photo 
detector. For frictional sliding, the back end of the cantilever is displaced relative 
to the sample along the indicated scan direction. Because of the extremely small tip 
radius, the lateral size of the contacting interface is usually a few to tens of nano- 
metres, which provides an ideal single-asperity contact. During a friction experi- 
ment, the relative lateral displacement between the sample and the holder is 
increased at a constant rate and the resulting lateral force is measured while the 
normal load is maintained constant via an active feedback loop. 

Normal and lateral force constants of the probes were calibrated using Sader’s 
method” and a diamagnetic lateral force calibrator” respectively. All measure- 
ments were carried out in a sealed chamber with RH varied by bubbling pure dry 
nitrogen vapour from a liquid nitrogen dewar through water (or pure nitrogen, for 
measurements quoted at <1% RH). The system was equilibrated for at least 4h 
after each humidity change. RH was monitored by a hygrometer (uncertainty 
*+1.5% RH). 

Sample preparation. The probes and substrate were processed by heating silicon 
probes (CSC37 Mikromasch) and Si(100) wafers in pure oxygen for 150 min and 
300 min, respectively, to form oxides 100 nm and 400 nm thick (determined by 
ellipsometry), respectively. The substrates were cleaned with Piranha solution, 
rinsed with de-ionized water, dried under pure nitrogen flow, then rapidly inserted 
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into the AFM. The substrates have a typical root-mean-square roughness of about 
0.2 nm within 0.5 tm X 0.5 jm area as measured by an AFM. 

The hydrogen-terminated diamond sample was grown by chemical vapour 

deposition (typical grain size ~ 1 um) and subsequently exposed to atomic hydro- 
gen using an established method”’. The graphite surface was prepared by freshly 
cleaving bulk highly ordered pyrolytic graphite. 
SA-SHS experiments. A typical friction experiment is performed by first making 
contact between the tip and substrate, then sliding for an initial run-in distance 
(typically ~1mm, in successive 1-j1m back-and-forth lateral displacements). 
Then, without breaking contact, a slide-hold-slide procedure was used in which 
the tip is slid for a prescribed distance (typically ~400 nm), then held stationary 
for a time fyoig, and then lateral displacement is resumed. Constant normal load is 
imposed by a feedback loop. Four different hold times, in an alternating sequence 
of 0.1s, 100s, 1s and 10s, were used. 

Two protocols are used as shown by the schematics in Supplementary Fig. 2. 
Supplementary Fig. 2a shows the positive lateral hold force protocol. The tip is slid 
for an initial distance (a[1], a[2]), then held stationary for a time fnoig without 
changing the lateral force (a[3]). Sliding is then reinitiated in the same direction by 
ramping up the lateral force (a[4]). Supplementary Fig. 2b shows the negative 
lateral hold force protocol. The tip is slid for an initial distance (b[1]), and then 
held stationary for a time fyo1q Without changing the lateral force (b[2]). Sliding is 
then reinitiated in the opposite direction by ramping up the lateral force (b[3], 
b[4]). The lateral force during the hold is ‘positive’ and usually very close to the 
steady-state friction value for the positive lateral hold force protocol, and is ‘nega- 
tive’ and very close to the steady-state friction level for the negative lateral hold 
force protocol. The results shown in the main text are obtained using the positive 
hold force protocol; the results from the negative hold force protocol are given in 
Supplementary Figs 6 and 7 for comparison. 


©2011 Macmillan Publishers Limited. All rights reserved 
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Acute vision in the giant Cambrian predator 
Anomalocaris and the origin of compound eyes 


John R. Paterson!, Diego C. Garcia-Bellido”, Michael S. Y. Lee**, Glenn A. Brock®, James B. Jago*"® & Gregory D. Edgecombe’ 


Until recently’, intricate details of the optical design of non- 
biomineralized arthropod eyes remained elusive in Cambrian 
Burgess-Shale-type deposits, despite exceptional preservation of 
soft-part anatomy in such Konservat-Lagerstatten”’. The structure 
and development of ommatidia in arthropod compound eyes sup- 
porta single origin some time before the latest common ancestor of 
crown-group arthropods’, but the appearance of compound eyes in 
the arthropod stem group has been poorly constrained in the 
absence of adequate fossils. Here we report 2-3-cm paired eyes 
from the early Cambrian (approximately 515 million years old) 
Emu Bay Shale of South Australia, assigned to the Cambrian apex 
predator Anomalocaris. Their preserved visual surfaces are com- 
posed of at least 16,000 hexagonally packed ommatidial lenses (in a 
single eye), rivalling the most acute compound eyes in modern 
arthropods. The specimens show two distinct taphonomic modes, 
preserved as iron oxide (after pyrite) and calcium phosphate, 
demonstrating that disparate styles of early diagenetic mineraliza- 
tion can replicate the same type of extracellular tissue (that is, 
cuticle) within a single Burgess-Shale-type deposit. These fossils 
also provide compelling evidence for the arthropod affinities of 
anomalocaridids, push the origin of compound eyes deeper down 
the arthropod stem lineage, and indicate that the compound eye 
evolved before such features as a hardened exoskeleton. The 
inferred acuity of the anomalocaridid eye is consistent with other 
evidence that these animals were highly mobile visual predators in 
the water column**. The existence of large, macrophagous nektonic 
predators possessing sharp vision—such as Anomalocaris—within 
the early Cambrian ecosystem probably helped to accelerate the 
escalatory ‘arms race’ that began over half a billion years ago”*. 

Anomalocaridids are broadly acknowledged to be the top predators 
in Cambrian and Ordovician ecosystems®’. Predatory habits are 
inferred on the basis of their large size (body length more than 
91.5 cm, ref. 9), robust spinose frontal appendages, mouth with a dentate 
inner margin’, size and form of the midgut glands", predation damage 
to associated biota'®’* and interpretation of coprolites as having an 
anomalocaridid origin®'*. The streamlined profile of the body, inferred 
function of the lateral body flaps (‘swim flaps’) in locomotion, and the 
large tripartite tail fan preserved in a few taxa indicate strong swimming 
capabilities*'*'®. The large, stalked eyes of anomalocaridids, emerging 
from the dorsolateral side of the head, provide additional evidence for 
prey detection and tracking consistent with predatory habits. Although 
the eyes are preserved in several anomalocaridid genera’***'”"*, they 
have until now been known solely from their outlines. The inference 
that they were probably compound eyes’® has been based on size, shape 
and the phylogenetic placement of anomalocaridids in the arthropod 
stem group'*”° rather than on direct evidence of surface detail (such as 
preserved lenses). 

Exceptionally preserved eyes from the early Cambrian (Series 2, 
Stage 4) Emu Bay Shale’ at Buck Quarry, Kangaroo Island, South 


Australia, are much larger, and have very different morphology, than 
those of another arthropod documented from this locality’. The visual 
surface is pyriform (pear-shaped) rather than hemispherical, and the 
ommatidia are more than five times as numerous, yet much smaller in 
maximum size, with no evidence of a distinct ‘bright zone’. The 
specimens are registered in the collections of the South Australian 
Museum. SAM P45920a,b (part and counterpart) is a pair of eyes 
(Fig. 1) that are in contact with each other at their proximal ends, their 
long axes diverging from each other at 45 degrees. Both eyes are of the 
same size and morphology, and a chance association can be ruled out; 
they are identified as the left and right eyes of a single individual; 
although the eyes in most other anomalocaridids are widely 
spaced’®"*”8, the preserved orientation of the eyes in SAM P45920 
is similar to that of Amplectobelua symbrachiata from China (figure 
15A in ref. 15) and probably results from lateral or oblique compres- 
sion of the head. They are each composed of a pyriform visual surface 
with a very large number of small, hexagonal ommatidial lenses. In the 
more complete of the pair (Fig. 1d), the long axis of the visual surface is 
21.8 mm long, its maximum width 12.2 mm (including a fracture, 
Fig. 1b). Part of this eye is overprinted across its width by a different 
structure of undetermined nature (‘us’ in Fig. 1b, d). It is preserved in 
positive relief on the part, with its lenses concave. The other eye has a 
maximum width of 12.7 mm; it is preserved in negative relief on the 
part, and its lenses are convex. In the counterpart, the concavity/ 
convexity of the visual surface and lenses is reversed. We interpret 
the sediment in the proximal portion of this eye as having in-filled 
the collapsed cavity, and the cuticle is now replicated by iron oxide. 
Lenses range from ~70-110 jum in diameter, with variability throughout 
the visual field attributed to surface irregularities. The lenses are con- 
sistently arranged with regular hexagonal packing relative to their 
neighbours (Fig. 1c, e). Extrapolating from the regions with the best 
preserved lenses across the area of the entire visual surface of the eye in 
Fig. 1d indicates that an estimated 16,700 lenses are present on one side 
of the eye. The proximo-distal axis of the visual field is established on 
the basis of the preservation of the eye stalk on the left eye, with a sharp 
delineation from the visual surface (Fig. 1b, d). The eye stalk lacks 
obvious surface sculpture and is more strongly developed along one 
side of the visual surface than the other. A second specimen of a single 
eye, P46330a,b (Fig. 2b and Supplementary Fig. la-e), is even larger, 
16.9 mm wide, and has patches of well-preserved lenses, but is too 
incomplete to measure its length. 

Scanning electron microscopic energy dispersive spectrometry 
(SEM-EDS) analysis of SAM P45920a shows that the visual surface 
is composed of iron oxide with detectable traces of sulphur (Fig. 2a), 
probably indicating limonite after pyrite. Limonite pseudomorphs of 
microcrystalline pyrite are often found concentrated beneath and 
between non-trilobite arthropod cuticle in the Emu Bay Shale (figure 7 
in ref. 22), as are peripheral haloes of iron oxide around a variety of 
fossils, suggesting that pyritization was a common preservation mode 
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Figure 2 | SEM-EDS analyses of Anomalocaris eyes. a, SAM P45920a. Scale 
bar, 1 mm; see Fig. 1a for area analysed. b, SAM P46330b. Scale bar, 0.3 mm; see 
Supplementary Fig. le for area analysed. Accelerating voltage of 20kV. Each 
map depicts the relative abundance of each element, with brighter colours 
indicating greater abundance. SEM, backscattered electron image of area 
analysed. Al, aluminium (green); Ca, calcium (cyan); Fe, iron (red); O, oxygen 
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Figure 1 | Anomalocaris eyes from 
the Emu Bay Shale. a-d, Eye pair, 
SAM P45920a, level 10.4 m. 

a, b, Overview and camera lucida 
drawing. Scale bars, 5 mm. Grey fill 
in b represents visual surface, the 
proximal part in the upper eye 
extrapolated from the lower eye. 

c, Detail of ommatidial lenses located 
by horizontal white box in a. Scale 
bar, 1 mm. d, More complete eye, 
showing transition between visual 
surface and eye stalk (white arrows). 
Scale bar, 2mm. e, Detail of 
ommatidial lenses in counterpart 
SAM P45920b. Scale bar, 0.3 mm. es, 
eye stalk; L.c., Isoxys communis; us, 
undetermined structure; vs, visual 
surface. Tilted white box in 

a represents area analysed using 
SEM-EDS, with elemental maps 
shown in Fig. 2a. 


(dark blue); P, phosphorus (purple); S, sulphur (yellow); Si, silicon (pink). The 
visual surface of SAM P45920a in a contains elevated amounts of iron, oxygen 
and sulphur, indicative of limonite after pyrite; the matrix (at left of each image) 
shows high levels of silicon and aluminium, reflecting muscovite and 
aluminosilicate clay minerals. The lenses of SAM P46330b in b contain elevated 
amounts of calcium and phosphorus, indicative of calcium phosphate. 
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within this deposit. Moreover, Anomalocaris frontal appendages and 
lateral body flaps (Supplementary Fig. 1f) from the Emu Bay Shale are 
often preserved as iron oxide, suggesting that pyritization of the entire 
body was possible, as in the anomalocaridids from the Ordovician 
Fezouata Biota in Morocco’ and Schinderhannes bartelsi from the 
Devonian Hunsriick Slate in Germany". Early diagenetic pyritization 
of soft tissues is also common in the Chengjiang Biota and requires high 
concentrations of iron in pore waters and low organic content within the 
sediment to promote reactions between iron minerals and H,S generated 
by anaerobic bacterial sulphate reduction”’. These specific conditions 
seem to have been present during the deposition of the fossiliferous 
interval of the lower Emu Bay Shale, the mudstones being iron rich 
but extremely low in total organic carbon”. Interestingly, an SEM- 
EDS analysis of SAM P46330b (Fig. 2b) reveals that the lenses contain 
elevated amounts of calcium and phosphorus (and only minor traces of 
iron), suggesting that phosphatization (rather than pyritization) was the 
taphonomic pathway in this instance. The role of phosphatization was 
also important in preserving the eyes of a different arthropod in the 
Emu Bay Shale’, demonstrating that disparate styles of early diagenetic 
mineralization can replicate the same type of extracellular structure (in 
this case, the cuticle of the visual surface) within a single Burgess-Shale- 
type deposit. This contrasts with many other Cambrian Konservat- 
Lagerstatten, wherein recalcitrant tissues (such as cuticle) typically 
preserve as carbonaceous films** and may therefore explain why 
intricate details of the visual surface (especially in non-biomineralizing 
arthropods) are rarely preserved". 

These Emu Bay Shale eyes (Fig. 1 and Supplementary Fig. la—e) are 
identified as belonging to anomalocaridids. Two species of Anomalocaris 
occur in the Emu Bay Shale, A. briggsi Nedin, 1995, and Anomalocaris 
sp. nov.”*. Both taxa are present in Buck Quarry as well as at the locality 
on the coastline at Big Gully. Frontal appendages and lateral body flaps 
of Anomalocaris are present at the levels (10.4 and 11.3 m) from which 
the eyes were collected, and are common within a 2-m interval that 
includes these levels (Supplementary Figs 1f and 2). The large size of the 
eyes rules out an assignment to all other arthropods known from the 
Emu Bay Shale, but is consistent in size with the eyes of anomalocaridids 
known from articulated specimens from other Cambrian Konservat- 
Lagerstatten (Supplementary Table 1). The shape of the Emu Bay Shale 
eyes compares closely to that of Anomalocaris'*"’, Amplectobelua"’, 
Laggania’® and Hurdia’*, all being variably pyriform or ovoid; the 
length:width ratio (1.8:1) of the present elements falls well within 
the variation in anomalocaridids as a whole (1.3:1-2.0:1) (Supplemen- 
tary Table 1). The width of the eye stalk relative to the width of 
the distal pyriform section of the eye (that bears the visual surface in 
SAM P45920a) likewise resembles the stalked eyes of articulated 
anomalocaridids. Considering eye length in articulated specimens of 
Anomalocaris from Canada” and China’? relative to the length of the 
frontal appendages, the Emu Bay Shale eyes are of an appropriate size 
relative to the frontal appendages of A. briggsi. We thus identify the 
specimens as Anomalocaris, the only anomalocaridid genus known 
from the Emu Bay Shale. 

The number of ommatidia in the Anomalocaris eyes would almost 
certainly have greatly exceeded the count based on the exposed surface 
of the eye alone. If the flattened surface in the fossils is even partly 
mirrored on the other side, as suggested by the fact that the three 
available specimens each have lenses over the entire exposed area of 
the fossils, but factoring in possible asymmetry to provide cuticular 
support of the visual surface, the total count could be substantially 
greater than the observed 16,000+ lenses. If this is indeed the case, 
few living arthropods have as many ommatidia, and these eyes would 
certainly have functioned with a high degree of acuity”®. Assuming that 
SAM P46330 has the same proportions as the more complete SAM 
P45920, a length of the visual surface in excess of 3 cm is inferred, with 
the likelihood of an even greater ommatidial count than in SAM 
P45920. Throughout the geological history of Arthropoda, compound 
eyes have rarely exceeded this size; very large Siluro-Devonian pterygotid 
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eurypterids and some Jurassic thylacocephalans represent some of the 
rare examples with eyes larger than those of Anomalocaris. 

The discovery of compound eyes in Anomalocaris provides com- 
pelling support for arthropod affinities’**. Dense, hexagonal packing 
of ommatidia in compound eyes has been demonstrated to have been 
unequivocally present in Schinderhannes bartelsi, a Devonian species 
resolved as the immediate sister group to the arthropod crown group” 
(Fig. 3). The eyes of Schinderhannes resemble those of Anomalocaris in 
being large, stalked, having an ovoid outline of the visual surface, anda 
highly elevated number of lenses. The finding that Anomalocaris, 
resolved more basally than Schinderhannes in the arthropod stem 
group’’, possesses the same kind of ommatidial packing as in 
Schinderhannes and crown-group arthropods pushes the origin of 
compound eyes further down the arthropod stem group. As such, 
compound eyes evolved earlier than the origin of a hardened tergal 
exoskeleton and biramous trunk limbs (the latter characters being 
present in Schinderhannes but not anomalocaridids). We infer that 
the stalked eyes of all Radiodonta’’ (that is, anomalocaridids) are 
arthropod-type compound eyes. Previous inferences of this character 
distribution’® based on gross morphology (size and stalks) now have 
direct support. The mode of growth of the anomalocaridid eye is 
presumed to be the same as in other arthropods, in which new 
elements are added at the margins of the visual field*. The next most 
stem-ward (basal) taxon in the arthropod stem group’*”’, Opabinia, 
also has stalked eyes, but direct observation of ommatidia is needed to 
ascertain whether they are compound. 

The resolving power of compound eyes depends to a large extent on 
the angle between ommatidia; smaller angles mean denser image 
sampling and higher acuity. Although exact angles in the Anomalocaris 
eyes cannot be determined owing to extensive compression, upper limits 
on average angles can be estimated (see Supplementary Methods). 
Assuming the preserved visual surface does not curve more than 180°, 
average interommatidial angles would be <1.4°; even this conservative 
estimate suggests greater acuity than most living arthropods”. The 
interommatidial angle and average lens diameter of ~95 um yields a 
low ‘eye parameter’ value of <2, characteristic of diurnal taxa living in 
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Figure 3 | The early evolution of compound eyes, and the position of 
anomalocaridids (Radiodonta), in the arthropod stem group. Numbers 
refer to the inclusiveness of the monophyletic group that can be confidently 
inferred to possess compound eyes: (1) based on extant taxa alone; (2) based on 
discovery of Schinderhannes’”; and (3) based on new data herein. Phylogeny 
after ref. 19. 
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well-lit marine environments”, and is consistent with suggestions that 
anomalocaridids were midwater predators in the photic zone’. 

The very large size of anomalocaridid compound eyes and the visual 
acuity inferred from the elevated lens number and lowinterommatidial 
angles suggest that processing of visual information would have 
required the optic neuropils and brain to be of comparable complexity 
to crown-group (that is, modern) arthropods. In the crown group, two 
optic neuropils are reconstructed in the most recent common ancestor, 
transmitting to a protocerebrum with a median unpaired neuropil, 
the central body”. The eyes of Anomalocaris expand the known 
diversity of visual adaptations in the early Cambrian: low-resolution 
organs with <100 ommatidia (eodiscoid trilobites), higher-resolution 
eyes with a distinct bright zone that might have functioned in low 
light', and very large eyes with a uniformly dense visual field adapted 
to bright environments. 

Functional morphological arguments from the structure of raptorial 
frontal appendages and large, spherical eyes in various Cambrian 
arthropods indicate that visual predation in the water column was 
already established as an important component in early Cambrian 
food webs””*. The large absolute size and huge number of ommatidial 
lenses in the eyes of Anomalocaris confirm its status as a highly visual 
apex predator’. The presence of sophisticated nektonic predators with 
acute vision, such as anomalocaridids, within Cambrian communities 
would have placed considerable selective pressures on prey that would 
have influenced the ‘arms race’ that began during this important phase 
in early animal evolution’®. 
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Ecology drives a global network of gene exchange 
connecting the human microbiome 


Chris S. Smillie!*, Mark B. Smith?*, Jonathan Friedman, Otto X. Cordero®, Lawrence A. David* & Eric J. Alm**® 


Horizontal gene transfer (HGT), the acquisition of genetic material 
from non-parental lineages, is known to be important in bacterial 
evolution’’. In particular, HGT provides rapid access to genetic 
innovations, allowing traits such as virulence’, antibiotic resistance* 
and xenobiotic metabolism® to spread through the human micro- 
biome. Recent anecdotal studies providing snapshots of active gene 
flow on the human body have highlighted the need to determine the 
frequency of such recent transfers and the forces that govern these 
events**. Here we report the discovery and characterization of a 
vast, human-associated network of gene exchange, large enough to 
directly compare the principal forces shaping HGT. We show that 
this network of 10,770 unique, recently transferred (more than 99% 
nucleotide identity) genes found in 2,235 full bacterial genomes, is 
shaped principally by ecology rather than geography or phylogeny, 
with most gene exchange occurring between isolates from ecologi- 
cally similar, but geographically separated, environments. For 
example, we observe 25-fold more HGT between human-associated 
bacteria than among ecologically diverse non-human isolates 
(P= 3.0 X 10-7”). We show that within the human microbiome this 
ecological architecture continues across multiple spatial scales, func- 
tional classes and ecological niches with transfer further enriched 
among bacteria that inhabit the same body site, have the same oxygen 
tolerance or have the same ability to cause disease. This structure 
offers a window into the molecular traits that define ecological 
niches, insight that we use to uncover sources of antibiotic resistance 
and identify genes associated with the pathology of meningitis and 
other diseases. 

The human body is a complex biological network comprising ten 
microbes for each human cell and 100 microbial genes for each unique 
human gene’. Because this hidden microbial majority is known to have 
profound impacts on many aspects of human health including 
immunity’, inflammatory disease* and obesity’, considerable efforts 
are underway to document the genetic diversity of the human micro- 
biome. The role of HGT in the generation and distribution of this 
biochemical repertoire is unclear, although anecdotal findings suggest 
that it may be significant**’°. In addition to informing our under- 
standing of microbial evolution, predictive models of gene transfer 
are needed for the effective engineering of the human microbiome 
because HGT facilitates rapid adaptation to drugs and other perturba- 
tions**. Until now, however, a dearth of available genome sequences 
and appropriate analytical techniques have left an incomplete view of 
the forces that govern HGT". 

Many previous efforts to explore these forces have highlighted the 
relationship between phylogeny and HGT’'™. Phylogeny is expected 
to influence HGT strongly because shared evolutionary history is asso- 
ciated with overlap in the host range of mobile elements", establishing 
a mechanistic basis for the phylogenetic control of gene exchange. 
Meanwhile, upon transfer, selection favours the persistence of genes 


acquired from close relatives, because these genes have greater com- 
patibility with native molecular machinery'*’’. 

Geography might provide an alternative structure to HGT by 
restricting dispersal, as suggested by the geographically organized dis- 
tribution of Vibrio cholera integrons’? and NDM-1 antibiotic resist- 
ance genes’, 

A third possibility is that ecological similarity shapes networks of 
gene exchange by selecting for the transfer and proliferation of adapt- 
ive traits or by increasing physical interactions between community 
members. Reports of enriched levels of HGT between hyperthermo- 
philes’’ and spatially segregated exchange among Shewanella isolates” 
offer suggestive glimpses of such an ecological structure. However, it 
has been difficult to determine whether ecology has a broader function 
in HGT because of the limited availability of genomes from similar 
environments and because most previous work has ignored the dis- 
tinction between recent transfers and ancient events. The inclusion of 
transfers from millions or billions of years in the past can obscure 
ecological structure, because historical niches may not reflect modern 
environmental associations. 

To explore the effects of phylogeny, geography and ecology on HGT 
we use an evolutionary-rate heuristic to identify recent transfers between 
thousands of microbial genomes. Our heuristic finds blocks of nearly 
identical DNA (more than 500 nucleotides, more than 99% identity) in 
distantly related genomes (less than 97% 16S rRNA similarity). HGT is 
the best explanation for these observations because the highly conserved 
16S gene evolves about 25-fold more slowly than protein-coding 
synonymous sites*!. As a result, vertically inherited orthologues in 
such divergent genomes are nearly saturated with mutations at 
synonymous sites”, in contrast to the almost perfect identity that 
we require. To avoid overcounting transfers, we cluster similar 
genomes and normalize against the number of possible comparisons. 

We have confirmed that at least 98% of all HGT events identified 
with our approach include a predicted protein-coding gene, indicating 
that potentially problematic non-coding elements do not significantly 
affect our results. To validate our HGT detection method further, we 
use two phylogenetic inference methods to evaluate the evolutionary 
origins of putatively transferred sequences. Quartet mapping and a 
gene loss analysis each support 99% of identified HGTs (Supplemen- 
tary Fig. 1). 

As expected, a large fraction of observed transfers (27%) include at 
least one predicted mobile element, underscoring the importance of 
these genes in facilitating exchange. However, when we account for 
redundancies we find that mobile elements such as plasmids (2%), 
phages (1%) and transposons (9%) reflect only a promiscuous minority 
of the 10,770 total unique proteins that we observe, whereas the majority 
of unique genes (87%) provide other functions. 

Direct exchange between any two bacteria in our data set is unlikely, 
both because we limit our analysis to distantly related bacteria 
and because strains were isolated from different human subjects or 
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environments, often on different continents. An average pairwise dis- 
tance of 7,000 km separates bacteria engaging in HGT. Therefore, each 
observed HGT probably reflects two independent acquisitions from a 
shared pool of mobile DNA, followed by proliferation. 

To quantitatively explore the connectivity of bacteria in the human 
microbiome relative to other environments, we compare gene transfer 
between the 1,183 human-associated bacteria and 1,052 non-human- 
associated isolates from a broad range of aquatic, terrestrial and host- 
associated environments across the world. Even after correcting for 
biased sampling of human-associated clades (see Methods), pairs of 
bacteria isolated from the human body are 25-fold more likely to share 
transferred DNA than pairs from other environments (P = 3.0 x 10° 7”; 
combined Mann-Whitney U-test). 

This enrichment in human-associated transfer may be caused by the 
prevalence of overlapping selective pressures in the tightly regulated, 
endothermic human host in comparison with diverse, non-human 
environments that experience significant temporal and spatial vari- 
ation in selective pressures. Consistent with this hypothesis, when the 
environment is specified more precisely by focusing on human isolates 
from the same body site, we observe twofold higher rates of transfer 
(P=9.9 X10 '°*; combined Mann-Whitney U-test). Among the 
most closely related isolates from the same body site, this corresponds 
to recent HGT in more than 40% of comparisons. This elevated trans- 
fer between bacteria isolated from similar environments extends 
beyond the human body, with threefold more HGT between bacteria 
isolated from the same non-human environment relative to isolates 
from different non-human environments (P = 1.3 X 10 7!; combined 
Mann-Whitney U-test). 

However, an alternative explanation for these observations is that 
closely related bacteria colonize similar environments, creating an 
apparent ecological effect that is actually driven by shared evolutionary 
history. To control for such a phylogenetic effect, we plotted observed 
HGT over a range of phylogenetic divergences and found that the 
strong enrichment for exchange within similar environments (same 
host, same body site, same non-human environment) persisted across 
all distances (Fig. 1). 

For a direct comparison of the relative contributions of phylogeny 
and ecology to the enrichment in human-associated transfer, we com- 
puted recent HGT between bacteria isolated from the human body 
(same ecology) and between these human-associated bacteria and all 
non-human-associated isolates (different ecology) over a range of 
phylogenetic distances. As shown by the dashed line in Fig. 2a, even 
the most deeply divergent bacteria that are separated by billions of years 
of evolution but share the same ecology engaged in more HGT than the 
mostly closely related isolates with different ecology. Thus, this recent 
gene exchange is structured by ecology more than by phylogeny. 

We used a similar approach to explore the influence of geography 
relative to phylogeny and found that exchange between continents was 
slightly lower than exchange within the same continent (Fig. 2b; 
P=0.02; combined Mann-Whitney U-test). However, this geo- 
graphic effect was much weaker than that of phylogeny, which was 
itself less informative than ecology. Taken together, these analyses 
indicate that recent HGT frequently crosses continents and the Tree 
of Life to connect the human microbiome globally in an ecologically 
structured network. 

This ecological architecture might reflect only the especially pro- 
nounced ecological differences between human-associated and non- 
human-associated bacteria. To determine whether ecology has a broad 
influence on recent gene exchange we searched for enriched HGT in 
narrower spatial, functional, and niche resolutions within the human 
host. Across all of these dimensions ecology strongly predicted gene 
exchange. 

In addition to the previously discussed finding that transfer was 
enriched among bacteria from the same body site (Fig. 1), we found 
that further specifying the subsite of isolation (for example, separating 
vaginal isolates from other urogenital isolates) revealed even higher 


242 | NATURE | VOL 480 | 8 DECEMBER 2011 


—  Non-human 
—— Same environment 
24 — Different environment 
40- \ 
2 : 14 
fo) 
2 
& 30- | ‘ 
\ 4 
8 3 6 9 12 15 18 
\ 
° 
S \ 
= =| " ‘Y Human 
2 \ ——-— Same body site 
\ — — — Different body site 
10-1 . +, Non-human 
t+ 
04 a ee gece 


16S distance (%) 


Figure 1 | Recent HGT is enriched in the human microbiome across all 
phylogenetic distances. HGT frequency is plotted as a function of the 
phylogenetic divergence between species for human-associated bacteria (a) and 
non-human-associated bacteria (b). We define species as clusters of genomes 
separated by less than 2% 16S rRNA divergence. HGT frequency is calculated in 
bins of 1% 16S rRNA divergence. Error bars indicate s.d. (see Supplementary 
Methods), with sample sizes described in Supplementary Table 8. These trends 
are also observed after controlling for the potential effects of sequencing centre 
contamination (Supplementary Fig. 4) and cosmopolitan strains 
(Supplementary Fig. 6). 


levels of transfer across all three annotated body subsites (vagina, 
gingiva and nasopharynx) (Fig. 3a and Supplementary Figs 2 and 3; 
P=1.7 X10 *; combined Mann-Whitney U-test). When all human 
and non-human environments were considered, with scales ranging 
from tissues to ecosystems, we found that exchange at a narrow spatial 
scale, within an environment, always exceeded exchange at a broader 
spatial scale, with all other environments (Fig. 3b; P = 1.3 10 773; 
combined 7’). 

Up to this stage, our analysis relied on isolation environment as a 
proxy for ecological similarity, ignoring heterogeneities within these 
sites. Next we explored these differences, by evaluating the effects on 
HGT of oxygen tolerance and pathogenicity—the only other sufficiently 
annotated ecological features. Even after controlling for the effects of 
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Figure 2 | Ecology is the dominant force shaping recent HGT in the human 
microbiome. a, The frequency of HGT between human-associated isolates 
(same ecology; blue) and between human-associated and non-human- 
associated isolates (different ecology; red). b, The frequency of HGT between 
bacteria isolated from the same continent (blue) and different continents (red). 
As a result of a reduced sample size in b, we pooled comparisons into larger 
phylogenetic distance bins of 3%. Error bars were calculated as in Fig. 1. The 
role of ecology in a is recovered when we control for sequencing centre 
contamination (see Supplementary Fig. 5). 
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Figure 3 | HGT is ecologically structured by functional class and at multiple 
spatial scales. The frequency of transfer between different environments is 
shown for all functional groups (a, b) and for antibiotic resistance (AR) genes 
only (c, d). Box widths indicate the number of genomes from each 
environment. a, When all genes are considered (upper half), human isolates 
form a block of enrichment (upper left). b, For every environment examined we 
observe more transfer within the same environment (black dots) than between 


body site and phylogeny, we found that HGT was also structured by 
oxygen tolerance (Fig. 4a; P= 7.710 '*; 7’) and pathogenicity 
(Fig. 4b; P= 7.4 X 10"; y*). These findings demonstrate that in addi- 
tion to the extensive spatial effects described above, chemical gradients 
and symbiotic relationships provide further ecological structure to 
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Figure 4 | Gene exchange is ecologically structured by oxygen tolerance and 
pathogenicity. The frequency of HGT between genomes with the same oxygen 
tolerance (a) and pathogenicity (b) is shown (blue) relative to their expected 
values (red). Expected values are based on overall frequencies of transfer 
between bacteria from the same distribution of body sites and phylogenetic 
distances. Bacteria that share the same oxygen tolerance (aerobic, anaerobic, 
microaerophilic or facultative aerobic) and pathogenicity (pathogenic or 
commensal) engage in significantly more HGT than expected under the null 
model, in which these traits have no influence on HGT. Error bars were 
calculated as in Fig. 1. 


environments (white dots). c, The fraction of gene transfers that includes at 
least one AR gene for each environment. Statistical uncertainty in the 
proportion of AR transfer is indicated by decreased colour saturation (see 
Methods). d, AR genes comprise a significantly higher fraction of observed 
HGT between different environments (white dots) relative to that within the 
same environment (black dots) in contrast to b. Uro, urogenital. 


recent HGT. Because these results persisted after controlling for expli- 
cit spatial effects, they seem to reflect selection rather than simply co- 
occurrence. 

To further explore the role of selection, we probed its effects on the 
proliferation of different functional classes. If selection influences the 
rates and bounds of gene exchange, then the transfer of genes provid- 
ing a non-specific selective advantage, such as antibiotic resistance, 
should show reduced environmental specificity relative to other, more 
niche-specific, functional classes. To test this prediction, for each 
environment, we considered the fraction of observed transfers that 
included at least one antibiotic resistance gene (Fig. 3c). In contrast 
to our earlier observation of increased transfer within sites when all 
functional classes were grouped together (Fig. 3a, b), here we observed 
that resistance comprised a higher fraction of transfers across different 
environments than within the same environment (Fig. 3d; 
P=6.9 X10 *”; combined 7’). Thus, when ecological forces tran- 
scend environmental boundaries, mobile genes do too. 

We have explored networks of gene transfer to evaluate the forces 
that influence recent HGT, finding that ecology is profoundly import- 
ant. Next we demonstrated how knowledge of this association between 
ecology and HGT could be used to reveal clinical insights from patterns 
of observed gene transfer. 

Our findings, coupled with previous results’, suggest that recently 
transferred genes between bacteria occupying a well-defined niche are 
especially likely to reflect adaptation to that niche. Consistent with this 
expectation, we found that many genes transferred between distantly 
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related meningitis isolates—such as hemolysins, adhesins and antibiotic 
resistance genes (Supplementary Table 1)—are known to be important 
in the disease”. We suggest that other transferred genes with unknown 
functions are probably cryptic virulence factors and should be prior- 
itized for experimental annotation. Thus, in addition to recovering 
known virulence factors, our approach might streamline the search 
for novel drug targets”, because although it is prohibitively difficult to 
explore all 24,095 unique meningitis genes with unknown function, it 
is tractable to evaluate the 13 that were recently transferred. We used 
this approach to identify genes associated with other diseases (for 
example pneumonia and endocarditis; Supplementary Tables 2 and 
3) and environments (for example hot springs and soil; Supplementary 
Tables 4 and 5), opening a molecular window into the genetic traits that 
define ecological niches. 

As a second example, our analysis of recent HGT revealed potential 
sources of clinical antibiotic resistance. We found that bacteria from 
farm animals and human food were enriched in transfer of resistance 
with human-associated bacteria relative to other non-human-associated 
isolates (P = 1.7 X 10 |! and P=0.01, respectively; Mann-Whitney 
U-test). In all, 42 unique antibiotic resistance genes were transferred 
between human and farm isolates. These transferred genes comprised 
nine families, all of which included both genes known to provide res- 
istance to clinical antibiotics and genes known to confer resistance to 
agricultural drugs (see Supplementary Table 6). This suggests that 
livestock-associated bacteria can contribute to clinical resistance 
without directly infecting humans, because for these mobile traits, 
genes, not genomes, serve as the unit of evolution and proliferation. 
Moreover, we observed 43 unique antibiotic resistance genes crossing 
national borders, suggesting that because the human microbiome is 
globally connected, local contamination of the shared mobile gene pool 
can have significant transnational consequences. 

We have shown that ecology governs recent HGT and used this 
finding to reveal the key genes and networks of exchange that facilitate 
colonization, and occasionally exploitation, of the human host. In the 
future this approach could be extended to analyse bacterial genomes 
from individuals or groups of individuals that differ in diet, disease or 
descent to search for the microbial genes that relate to these human 
conditions. 


METHODS SUMMARY 


All 16S genes were identified with the GreenGenes database”. A total of 115 
genomes with spurious or truncated 16S sequences were excluded from our ana- 
lysis. We used BLAST (version 2.2.20) with default parameters” to calculate an all- 
against-all nucleotide alignment for 2,235 genomes downloaded from IMG”. We 
inferred HGT events from blocks of nearly identical DNA (more than 99% iden- 
tity, more than 500 bp) in distantly related genomes (less than 97% 16S rRNA 
similarity). To avoid overcounting events in ancestral lineages, we collapsed clo- 
sely related genomes by using average linkage clustering into groups (‘species’) 
with a 16S dissimilarity of 2%. For each pair of these clusters, we calculated the 
fraction of genome comparisons between clusters that shared at least one inferred 
HGT event. We summed this fraction over all pairs of clusters and normalized to 
the total number of comparisons, to calculate the HGT per 100 comparisons. 
Statistical tests of HGT enrichment were performed separately for each distance 
bin, then combined into a single p value with Fisher's method. We modelled 
antibiotic resistance transfer as a binomial random variable with parameter p 
and calculated a 95% confidence interval around our estimate of p. The size of 
this confidence interval, which is the statistical uncertainty of our estimate, was 
used to desaturate the colour of the heat map in Fig. 3c. To explore the effects of 
oxygen tolerance and pathogenicity on HGT, we used a a test to compare the 
observed frequency of HGT with the expected value given the distribution of body 
sites and phylogenetic divergences. Protein-coding regions were identified and 
annotated with BLASTX’* (Expect (E) value< 10 °°) and UBLAST”® (maxtar- 
gets = 100, Evalue < 10 °°) searches against the NCBI nr database. Unique genes 
reflect unique best BLAST hits to the database. Antibiotic resistance genes were 
annotated with the Antibiotic Resistance Genes Database’. Data sets are available 
from http://almlab.mit.edu/data. 
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Spontaneous epigenetic variation in the Arabidopsis 


thaliana methylome 


Claude Becker!™, Jorg Hagmann'*, Jonas Miiller', Daniel Koenig", 


Heritable epigenetic polymorphisms, such as differential cytosine 
methylation, can underlie phenotypic variation’. Moreover, wild 
strains of the plant Arabidopsis thaliana differ in many epialleles**, 
and these can influence the expression of nearby genes'”. However, 
to understand their role in evolution’, it is imperative to ascertain 
the emergence rate and stability of epialleles, including those that 
are not due to structural variation. We have compared genome- 
wide DNA methylation among 10 A. thaliana lines, derived 30 
generations ago from a common ancestor®. Epimutations at indi- 
vidual positions were easily detected, and close to 30,000 cytosines 
in each strain were differentially methylated. In contrast, larger 
regions of contiguous methylation were much more stable, and 
the frequency of changes was in the same low range as that of 
DNA mutations’. Like individual positions, the same regions were 
often affected by differential methylation in independent lines, 
with evidence for recurrent cycles of forward and reverse muta- 
tions. Transposable elements and short interfering RNAs have 
been causally linked to DNA methylation®. In agreement, differ- 
entially methylated sites were farther from transposable elements 
and showed less association with short interfering RNA expression 
than invariant positions. The biased distribution and frequent 
reversion of epimutations have important implications for the 
potential contribution of sequence-independent epialleles to plant 
evolution. 

Although there is no doubt that DNA sequence mutations are the 
primary raw material for evolutionary change, local DNA methylation 
variants with major effects on the expression of nearby genes can be 
inherited over many generations'”. However, such epialleles are not 
always as stable as the primary DNA sequence*”"'’. New sequencing 
technologies have recently enabled the direct determination of spon- 
taneous DNA mutation rates’’, and we have previously reported that A. 
thaliana experiences about one single-base-pair mutation per haploid 
genome and generation’. This analysis was based on a set of five muta- 
tion accumulation lines that had been derived from a single individual 
of the inbred strain used to produce the high-quality reference genome 
sequence for A. thaliana. These lines had been separately propagated in 
a common environment by single-seed descent for 30 generations®. We 
examined whole-genome cytosine methylation'*"* in these five lines 
plus five additional lines of the same population by Illumina sequen- 
cing. We interrogated two siblings each of the 31st generation with an 
average strand-specific coverage depth of 20X per individual; changes 
shared within a line should predominantly reflect differences that had 
accumulated by the 30th generation. Because seeds from the founders 
were no longer available, we compared the 31st generation individuals 
to two independent lines that had been propagated for only three 
generations (Supplementary Fig. 1). 

Out of all cytosine residues with high-quality sequencing support 
(see Supplementary Methods), on average 2.8 million were found to be 
methylated in each line (Supplementary Table 1). The higher genome- 
wide methylation rate in our analysis compared to previous studies'*"* 


Oliver Stegle”, Karsten Borgwardt” & Detlef Weigel’ 


reflects the greater statistical power afforded by increased sequencing 
depth. We subsequently evaluated 13.9 million cytosines that had at 
least threefold coverage in all individuals, of which 3 million were 
methylated in at least one strain. Using Fisher’s exact test, we identified 
about 186,000 (6.2%) positions with a significant change in methyla- 
tion (false discovery rate <0.05) between at least one 31st generation 
and both 3rd generation lines. Almost all, 99.6%, of these differentially 
methylated positions (DMPs) were also detected with an entropy- 
based method"’. Given the limited statistical power for weakly methy- 
lated or poorly covered sites (Supplementary Figs 2-4), our DMP 
estimate would almost certainly increase with higher sequencing 
depth. For further analyses, we considered sites that agreed between 
31st generation siblings (on average, 99.8%) and between the two 
strains closest to the founder generation (99.7%). 

CG sites were highly over-represented among DMPs (Fig. 1a). This 
is unlikely to reflect greater instability of CG compared to CHG and 
CHH positions (where H is A, T or C), but rather higher statistical 
power in detecting a change at CG sites, which are on average much 
more highly methylated’** (Supplementary Fig. 4). Among CG sites 
in genic regions, including those producing non-coding RNAs, relative 
abundance of DMPs was two- to fourfold higher compared with non- 
differentially methylated positions (N-DMPs). The opposite was the 
case for CG positions in transposable elements and intergenic regions, 
with a similar, but less pronounced, bias for CHG and CHH sites 
(Fig. 1b). These observations were in agreement with CG-DMPs being 
found most often on chromosome arms, which have the highest gene 
density (Fig. 1c), even though cytosine methylation near the centro- 
meres is the highest'*'*. Gene body methylation gradually increases 
towards the 3’ end, before sharply decreasing at the end of the last 
exon’*"*'®17, although genes 1 kb or less in length were generally only 
weakly methylated (Supplementary Fig. 5a). The profiles of DMPs and 
N-DMPs were similar across individual genes, exons, introns and 
transposable elements (Fig. 1d and Supplementary Fig. 5b, c), but 
DMPs were less frequent in promoter and downstream regions. 
Notably, CG-DMPs accounted for 42% of methylated sites in gene 
bodies, despite all CG-N-DMPs outnumbering CG-DMPs four to one. 

Twenty-four-nucleotide-long small interfering RNAs (siRNAs) are 
important in maintaining DNA methylation’, and N-DMPs coincided 
seven times more often than DMPs with sites to which 24-nt siRNAs 
mapped'*. N-DMPs were also on average only half as far from such 
sites as DMPs (P<2.2 x 10 '°) (Fig. le and Supplementary Fig. 6a). 
siRNAs are enriched in and around transposable elements’’. In agree- 
ment, the average distance to the closest transposable element was 
much shorter for N-DMPs outside of transposable elements, com- 
pared to DMPs (P<2.2X 10 '°), even when only considering 
those in the centromere-distant regions of each chromosome, which 
contain relatively few transposable elements (Fig. le and Supplemen- 
tary Fig. 6b, c). 

A first major insight from our analyses is that transgenerational 
maintenance of CG methylation in transposable elements is apparently 
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Figure 1 | Genome-wide distribution of methylation polymorphisms. 

a, Contribution of CG, CHH and CHG sites to total and differential cytosine 
methylation. 32.8% of all CG, 15.7% of CHG and 4.6% of CHH sites, adding up 
to 10.8% of all cytosines, showed evidence of methylation. b, Distribution of 
DMPs and N-DMPs according to local annotation. CDS, coding sequence; 
ncRNA, non-coding RNA; TE, transposable element. c, Distribution of CG-N- 
DMPs and CG-DMPs along each chromosome. Data were normalized to the 


much more stable than CG methylation of protein-coding genes, con- 
sistent with DNA methylation being more important for controlling 
the activity of transposable element compared to the latter’*’””°*". This 
also agrees with a report that genic methylation is much more variable 
between wild strains of A. thaliana than methylation of transposable 
elements’. 

Hierarchical clustering based on DMPs grouped siblings as well as 
3rd and 31st generation lines together. An arbitrary selection of methy- 
lated positions, which included about 6% DMPs, produced a similar 
pattern; however, with N-DMPs only, clusters became much more 
random (Fig. 2a). These observations indicate that our DMPs capture 
most of the methylation differences between lines. We next calculated 
the pairwise distance between strains based on DMPs (Fig. 2b). 
Correlation was highest between the two 3rd generation strains, and 
each individual of the 31st generation was more similar to these two 
lines, from which they were separated by 34 generations, than to the 
other lines from the 31st generation, from which they had diverged 
for 62 generations. Taken together, we conclude that whole-genome 
methylation patterns are largely stable and therefore heritable in 
A. thaliana, but that differences in methylation status accumulate 
gradually, similar to genetic mutations. 

One strain, 69, was exceptional and had 40% more DMPs in com- 
parison with the 3rd generation than the other 31st generation lines 
(Fig. 2b). To determine whether this strain might have a defect in the 
methylation machinery, we sequenced its genome with more and longer 
reads compared to our previous analysis’. We found a non-synonymous 
change in MATERNAL EFFECT EMBRYO ARREST 57 (MEE57), which 
encodes a protein related to METHYLTRANSFERASE 1 (MET1) 
(Supplementary Fig. 7). MEE57 has been reported as essential for endo- 
sperm development”, although several A. thaliana strains lack func- 
tional MEE57 copies”. Thus, whether the MEE57 mutation contributes 
to the increased DMP number in line 69 remains unclear. The fact that 
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highest value for each chromosome and class. d, Averaged distribution of all 
methylated sites (5mC) and methylated CG and CHG sites along genes. Data 
were normalized to the highest value for each sequence context and class. The 
coding region is indicated by a black bar. e, Distance of DMPs and N-DMPs to 
the closest upstream and downstream 24-nucleotide siRNA and transposable 
element. Horizontal bar corresponds to median, whiskers indicate entire 75th 
percentile. 


the siblings of this line were as similar to each other as other sibling 
pairs (Fig. 2a) argues against a generally increased epimutation rate. 

Compared to genetic mutations, the frequency of epimutations at 
single cytosine residues was many orders of magnitude higher, with an 
average of close to 30,000 DMPs in the analysed sequence space, 
compared with less than 30 DNA sequence mutations per strain’. 
Thirty-two per cent of DMPs between generations 3 and 31 occurred 
more than once, and 13% more than twice (Fig. 2c). If DMPs arose 
randomly, we would expect less than 1% of recurrent events. That we 
observe many more indicates that certain positions are particularly 
prone to increases or decreases in methylation rate. To investigate 
directly how many DMPs emerge from one generation to the next, 
we analysed the 32nd generation of lines 39 and 49. These individuals 
were progeny of siblings of the individuals interrogated in the 31st 
generation, and shared changes in the 32nd generation should reflect 
differences that arose between the 30th and 31st generation. We found 
on average over 3,300 between-generation DMPs. This is in the same 
range as DMPs between siblings (on average, about 5,000), but more 
than we would have expected from the 30,000 that had accumulated 
between the 3rd and each of the 31st generation lines. One explanation 
is that frequent transgenerational changes in methylation status occur 
at a limited number of sites, and that only a fraction of new DMPs is 
maintained over the longer term. This is corroborated by the obser- 
vation that more than two-thirds of DMPs distinguishing the 32nd 
from the 31st generation in lines 39 and 49 had already been found in 
other 31st generation individuals. 

DNA methylation is known to occur nonrandomly, and to cluster in 
specific segments of the genome”!””*. We identified 249 differentially 
methylated regions (DMRs) that were at least 50 bp long (median 
100 bp, maximum 650 bp) (Supplementary Table 2 and Supplemen- 
tary Methods). Although probably a conservative estimate, the number 
of DMRs per line is in the same range as the DNA sequence mutations, 
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Figure 2 | Epigenetic diversity in the analysed population. a, Hierarchical 
clustering based on 20,000 sites each, drawn randomly from DMPs identified in 
pairwise comparison between all strains, cytosines methylated in at least one of 
the analysed strains (including about 6% DMPs), or N-DMPs. b, Heat map 
representing pairwise Pearson’s correlation coefficient (PCC) between 


less than 30 per line’. As with CG-DMPs, DMRs preferentially loca- 
lized to genes (Fig. 3a). DMRs did not overlap with known DNA 
mutations in these strains’. Similarly, structural variant discovery with 
established methods**** did not reveal evidence for DMRs being due to 
gross DNA lesions. The frequency of DMRs along genes was similar to 
the overall distribution of methylated cytosines, and was reminiscent of 
the pattern of variation seen in wild strains of A. thaliana* (Supplemen- 
tary Fig. 8). There were almost ten times as many DMRs in exons as in 
introns (Fig. 3a). Because exon-specific methylation may influence 
RNA splicing patterns*®’’, this could also be a source of variation in 
gene activity. Hierarchical clustering according to DMRs separated 
early- and late-generation strains into distinct groups (Fig. 3b). 
Notably, if we consider the methylation status in the 3rd generation 
individuals as largely reflecting the ancestral pattern, similar fractions of 
DMBs had lost or gained methylation by the 31st generation (Fig. 3c). 

Similarly to DMPs, recurrent events constituted more than one- 
third of all DMRs, indicating that the affected genomic regions were 
privileged sites of change (Fig. 3d, e). In addition, comparison of 
generations 32 and 31 identified four short DMRs per line, with re- 
methylation of one segment that had become unmethylated in 
generation 31 (Supplementary Fig. 9). Together, these observations 
demonstrate that large changes in methylation, although rare, can 
occur even within a single generation. 

Differences in promoter and genic DNA methylation can affect 
RNA levels'®. We compared the transcriptomes of two randomly 
selected strains of the 31st generation with the 3rd generation strains 
by RNA-seq (Supplementary Fig. 10) and identified 320 differentially 
expressed genes in pairwise comparisons between strains (Fig. 3f and 
Supplementary Table 3). The two 31st generation lines were separated 
from each other by the most changes, and the two 3rd generation lines 
by the fewest. Seven differentially expressed genes overlapped with a 
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individuals, considering all 250,000 DMPs identified between all strains. PCCs 
between 3rd generation strains, 0.92; between 3rd and 31st generation, 0.63- 
0.77; between 31st generation lines, 0.52-0.66. The histogram on top of the 
colour key indicates counts of PCC bins. c, Epiallele frequency of DMPs in the 
31st generation. 
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DMR in these strains (Fig. 3g). For the three genes with the highest 
difference in expression level and overlapping with the most cons- 
picuous DMRs, we observed a negative correlation between DNA 
methylation and gene expression. The remaining four genes over- 
lapped with much shorter DMRs and no correlation was apparent 
(Fig. 3e, g and Supplementary Fig. 11). 

We have presented a high-resolution analysis of transgenerational 
variation in DNA methylation of A. thaliana. The molecular mechan- 
isms underlying these changes remain elusive, but siRNAs, which map 
very often in or near transposable elements’’, probably have a role in 
stabilizing DNA methylation, which is corroborated by our finding 
that DMPs tend to be farther from transposable elements and to be 
associated with lower local siRNA activity than N-DMPs. These obser- 
vations indicate that the density and distribution of transposable ele- 
ments, which can differ greatly even between closely related species”, 
affect epigenetic variation throughout the genome. In the material 
analysed here, there was no evidence for DNA mutations acting in 
cis as an important cause of DMRs, although we cannot rule out that a 
non-synonymous mutation ina METI homologue might contribute to 
increased variation in DNA methylation in trans in one of the lines. 

In contrast to the high frequency of single-nucleotide methylation 
polymorphisms, larger regions appear to change methylation status at 
a rate that is comparable to genetic mutations. On the basis of previous 
work’, it is conceivable that the emergence of DMRs requires specific 
structural features such as nearby repeats. Although DMRs are rare, we 
found evidence for DMRs affecting gene expression, indicating that 
natural, sequence-independent epialleles could potentially contribute 
to phenotypic diversity. There are subtle morphological differences 
between the mutation accumulation lines®, and quantitative genetic 
approaches could be used to link specific DNA mutations or DMRs 
with such traits. How many of the methylation differences found 
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between wild strains* are due to sequence-independent changes versus 
ones driven by transposable elements and other structural variants is 
an important area for further investigation. In addition, it will be 
necessary to follow DNA methylation not only under benign green- 
house conditions, but also in the much more variable and stressful 
natural environment. 

Perhaps our most important finding is that the number of epimuta- 
tions does not increase linearly with time, indicating that many are not 
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Figure 3 | Differentially methylated regions (DMRs). a, Distribution of 
DMRs according to local annotation. Inter, intergenic; pseudo, pseudogene. 
b, Hierarchical clustering of individuals from the 3rd and 31st generation based 
on methylated sites in DMRs, ranked according to their position in the genome. 
Note the shared methylation differences across lines, but strict pairing of 
siblings. c, Regions with losses and gains of methylation in 1, 2 or 3 strains in 
generation 31 compared to the 3rd generation strains. d, Epiallele frequency of 
DMRs. e, DMR at At3g01345 (Chr3:129,159-130,670) across all strains. 
Methylation on both strands is indicated for each strain. Colours indicate 
methylated reads (red, CG; blue, CHG; yellow, CHH). Grey indicates reads 
supporting non-methylation. For simplicity, only one sibling is shown per 
strain. f, Differentially expressed genes in comparisons between 3rd and 31st 
generation strains. g, Average RNA expression levels of the genes overlapping 
with the regions in e and in Supplementary Fig. 11. Dots indicate values of 
individual samples. 


stably inherited over the long term. In addition to DMPs and DMRs 
that arose apparently independently in several strains, we even dis- 
covered a DMR that had become demethylated after 31 generations, 
but was re-methylated in the following generation. This suggests that 
DNA methylation in specific regions of the genome can fluctuate over 
relatively short timescales. Such sites can be considered as going 
through recurrent cycles of forward and reverse epimutation, which 
is very different from what is found at the level of the genome sequence, 
where reverse mutations are exceedingly rare. Importantly, reversion 
rates directly determine the ability of any type of allele to be subject to 
Darwinian selection. This needs to be taken into account when con- 
sidering the potential of epialleles as a factor in evolution’. 


METHODS SUMMARY 


Methylome sequencing. DNA was prepared from nuclei isolated from leaf tissue, 
bisulphite treated using a modification of a published protocol", and paired-end 
sequenced on the Illumina GAIIx platform. After image analysis and base calling 
with the Illumina pipeline, reads were processed using SHORE”, and aligned to 
the Col-0 reference genome with GenomeMapper”, adapted to the analysis of 
bisulphite sequencing data. Bisulphite conversion rates, as determined from 
unmethylated chloroplast and spiked-in lambda phage DNA, were 99.72% to 
99.84%. 

Analysis of methylated positions. Single sites were classified as methylated or 
unmethylated by fitting a binomial model based on reads falsely reporting methy- 
lation on the unmethylated plastid genome. We only considered cytosine residues 
covered by at least three independent high-quality base calls in all strains. For the 
determination of significant differences in methylation across strains on single 
sites or regions, we used Fisher’s exact test. 

Data visualization. A Gbrowse instance of the methylation profiles is available at 
http://gbrowse.weigelworld.org/fgb2/gbrowse/ath_methyl_ma. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Plant growth and material. Seeds were derived from Arabidopsis thaliana 
Columbia-0 lines in generation 3 (lines 4 and 8), generation 31 (lines 29, 39, 49, 
59, 69, 79, 89, 99, 109 and 119) and generation 32 (lines 39 and 49), counting from 
the founders, as described by Shaw and colleagues® (Supplementary Fig. 1). Plants 
were grown on soil under long-day conditions (23 °C, 16 h light, 8 h dark) after 
seeds had been stratified in 150 nM GA-supplemented water at 4°C for 6 days. 
Siblings were grown independently at different time points. Positions of the pots 
were randomized. 

Nucleic acid extraction. DNA was extracted from rosettes of individual 21-day- 
old plants. Plant material was flash-frozen in liquid nitrogen and ground in a 
mortar. The ground tissue was re-suspended in nuclei extraction buffer (10 mM 
Tris-HCl pH 9.5, 10 mM EDTA, 100mM KCL, 0.5 M sucrose, 0.1 mM spermine, 
0.4mM spermidine, 0.1% B-mercaptoethanol). After cell lysis in nuclei extraction 
buffer with 10% Triton X-100, nuclei were pelleted by centrifugation at 2,000g for 
120s. Genomic DNA was extracted using the Qiagen Plant DNeasy kit (Qiagen). 
Total RNA was extracted from rosette leaves of individual plants using the Trizol 
(Invitrogen) method according to the manufacturer’s instructions. Residual DNA 
was eliminated by DNase I (Thermo Fisher Scientific) treatment. 

Library preparation. Preparation of DNA libraries for genomic sequencing was 
done using the NEBNext DNA Sample Prep Reagent Set 1 (New England Biolabs), 
following the Illumina Genomic Sample Prep Guide (Illumina). 500-1,000 ng 
genomic DNA was fragmented to 300 bp average size with a Covaris S2 instrument 
using the following settings for 120 s in frequency sweeping mode: intensity 5, duty 
cycle 10%, 200 cycles per burst. DNA was purified on Qiaquick PCR purification 
columns. Preparation of DNA libraries for bisulphite sequencing was adapted 
from ref. 14. Input DNA was fragmented as described above. Libraries were con- 
structed using the NEBNext DNA Sample Prep Reagent Set 1 (New England 
Biolabs) according to the Illumina Genomic Sample Prep Guide with the following 
modifications. We used the Illumina Early Access Methylation Adapter Oligo Mix 
(catalogue number ME-100-0010). After size selection, the non-methylated cyto- 
sine residues were converted to uracil using the EpiTect Plus DNA Bisulfite kit 
(Qiagen) according to the manufacturer’s guidelines. For higher conversion effi- 
ciency the bisulphite incubation was repeated. Library enrichment was performed 
with Pfu Cx HotStart Polymerase (Agilent) and 18 PCR cycles. Libraries for RNA 
sequencing were prepared from 4 j1g of total RNA using the Illumina Truseq RNA 
sample prep kit B according to the manufacturer’s protocol. 

Sequencing. All sequencing was performed on an Illumina GAIIx instrument. 
Genomic and bisulphite-converted libraries were sequenced with 2 X 101-bp 
paired-end reads. For bisulphite sequencing, conventional A. thaliana DNA geno- 
mic libraries were analysed in control lanes. Transcriptome libraries were 
sequenced with 101-bp single end reads, with three libraries with different index- 
ing adapters pooled in one lane; no control lane was used. For image analysis and 
base calling, we used the Illumina OLB software version 1.8. 

Processing and alignment of bisulphite-treated reads. The SHORE pipeline” 
was used to trim and quality-filter the reads. Its default parameters were applied for 
the filtering step: reads with more than 2 (or 5) bases in the first 12 (or 25) positions 
with a quality score less than 3 were discarded. Reads were trimmed to the right- 
most occurrence of two adjacent bases with quality values equal to or greater than 5. 
Trimmed reads shorter than 50 bases were discarded. The remaining high quality 
sequences (on average 82% of raw reads across the sequenced strains) were aligned 
against the Arabidopsis thaliana genome sequence version TAIR9 (http://www. 
arabidopsis.org/portals/genAnnotation/gene_structural_annotation/annotation_ 
datajsp) using a modified version of the mapping tool GenomeMapper” that 
supports the alignment of bisulphite converted reads. Bisulphite converts non- 
methylated cytosines into uracils, which are propagated as adenine-thymine base 
pairs after PCR amplification. GenomeMapper tolerates asymmetrical T-to-C or 
A-to-G mismatches (read base against reference base) and can distinguish between 
reads from the bisulphite-converted strand ofa DNA fragment and sequences from 
its complementary amplified strand, if the reads have been obtained by paired-end 
sequencing. Only the read from the strand with converted Cs is informative about 
the methylation status of the underlying cytosine site. We allowed for up to 10% 
single-base-pair substitutions along the read length in the alignment process for 
each read to retain most coverage. GenomeMapper reports all alignments with the 
least amount of mismatches for each read. However, only reads mapping uniquely 
to a single position were used for this study. Furthermore, all but one read were 
removed from further analysis if their 5’ ends aligned to the same genomic position, 
to account for amplification biases. A paired-end correction method” was used to 
discard repetitive reads by comparing the distance between reads and their partner 
to the average distance between all read pairs. Reads with abnormal distances were 
removed if there was at least one other alignment of this read in a concordant 
distance to its partner. Finally, read counts on all cytosine sites were obtained with 
SHORE. The ‘scoring matrix approach’ of SHORE” assigns a score to each site by 


testing against different sequence and alignment related features. The criteria and 
complete scoring matrix can be found in Supplementary Table 4. For comparisons 
across lines, cytosines were accepted if at most one intermediate penalty on its score 
was applicable to at least one strain (score =32). In this case, the threshold for the 
other strains was lowered, accepting at most one high penalty (score = 15). In this 
way, information from other strains is used to assess sites from the focal strain 
under the assumption of mostly conserved methylation patterns, allowing the 
analysis of additional sites. The methylation statistics on each single strain assumed 
a quality score of 25 or higher, which means no more than two intermediate 
penalties. 

Determination of methylated sites. Sequencing errors, noise and imperfections 
of the bisulphite conversion contribute to the occurrence of sites that appear to be 
weakly methylated. Reads mapping against the non-methylated chloroplast 
sequence allow for objective estimation of the effective background rate of false- 
positive methylation detection. For this purpose, we fitted an independent 
binomial model to the relative proportions of converted and unconverted reads 
that cover cytosines in the chloroplasts. We estimated the binomial rate of false- 
positive methylation from the maximum likelihood estimate, separately for each 
library and for different bins of total read coverage: 


?= argmax,II Binomial(7,,(n, +7.)|r)- 


Here, n, and 7, denote the number of converted and unconverted reads from 

the considered cytosine sites. Supplementary Fig. 12a shows the obtained back- 
ground methylation rate for a single strain, line 30-39, as a function of the total 
read coverage per site. The overall false methylation rate when combining read 
data across the range of read coverage was 0.22%, which deviates significantly from 
higher error estimates when considering low-coverage regions in isolation. To 
account for the variability in error rates in the downstream analyses, we used 
specific error models for each strain and for read-coverage bins of multiples of 
fivefold, yielding error rates between 0.2% and 5.0% (Supplementary Fig. 12b). For 
coverage bins with too few sites for robust statistical estimation (<50), we imputed 
the false methylation rate from the closest sufficiently populated coverage bin. 
Given the estimated rates for false methylation, we carried out a genome-wide test 
for significant methylation of cytosines. For each site, we calculated the P value 
under the background model. We then used Storey’s method*', an extension of the 
Benjamini-Hochberg stepdown procedure, to assess genome-wide significance 
using q values. At a joint false discovery rate (FDR) of 5% we found between 
2,316,966 and 3,458,949 methylated sites in each strain (Supplementary Table 
1). When reducing FDR to 0.1%, we still retained almost 85% of the methylated 
sites, showing that the number of sites with weak methylation evidence was low. 
For analysis of methylated sites reported in this study, an FDR of 5% was deemed 
to be acceptable. 
Identification of differentially methylated positions. From the 13.9 million 
cytosines for which we had at least three independent high-quality reads in each 
strain, we selected sites that showed significant methylation in at least one strain, 
resulting in 3,067,017 positions. Sites with statistically significant methylation 
differences were identified with Fisher’s exact test. P values from individual tests 
per site were combined into single P values via conservative Bonferroni correction. 
Genome-wide FDRs were then estimated using Storey’s method’’. To limit false- 
positive DMPs, we first identified 69,583 DMPs between siblings at a relaxed FDR 
of 10%. These sites were excluded as were 8,893 DMPs distinguishing the two 3rd 
generation strains. This left 2,988,541 positions as the final set to test for differ- 
ential methylation between generations. Twenty pairwise tests of each of the ten 
31st generation strains against both 3rd generation strains were conducted on sites 
consistently methylated between 31st generation siblings and in the 3rd genera- 
tion. At an FDR of 5%, this yielded 186,248 DMPs. DMP allele frequency was 
obtained by progressively removing the strain with the lowest q value and correct- 
ing the remaining P values for multiple testing by the methods described above. 

We applied the same strategy to identify DMPs that differed either between the 
31st and 3rd generation, or between 31st generation strains. Count data from 
replicates were combined for each site, followed by pairwise Fisher’s exact tests 
between all combinations of strains (66 tests). We estimated P values for at least 
one differential pair using a Bonferroni correction, followed by Storey’s method"! 
to assess genome-wide significance. At a joint FDR of 5%, this identified 253,546 
DMPs across all 12 strains. 

Assessing statistical power. Two main factors influence the power to detect 
methylation differences: the number of statistical tests and local read coverage. 
To assess the impact of multiple testing, we applied the approach described above 
to all sites with at least threefold coverage in at least 12 of 24 individuals examined. 
Of 25.3 million such positions, 4,547,568 were found to be methylated in at least 
one of the lines, compared to 3,067,017 out of 13.9 million positions when con- 
sidering only sites with complete information. The number of sites assessed as 
methylated thus increased roughly linearly with the number of tested sites, as did 
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the number of differentially methylated positions. Similarly, the fraction of DMPs 
shared in more than one 31st generation strain, ~31%, was very similar to the 
~32% found among sites with complete information. We conclude that our 
method is largely insensitive to the number of tests performed. 

To assess the effect of read coverage, we determined how many DMPs could be 
identified after subsampling at 25%, 50% and 75% of total coverage. We identified 
almost twice as many DMPs with 50% compared to 25% coverage, but only 13% 
additional DMPs were identified when increasing coverage from 75% to 100% 
(Supplementary Fig. 13). Although not yet asymptotic, we estimate that the false- 
negative rate is well below 50%, and most likely closer to 10%. 

Identification of differentially methylated regions. The ~186,000 DMPs dis- 
tinguishing 31st from both 3rd generation lines were consolidated into regions of 
adjacent DMPs for each strain, with a maximum distance of 50 bp between DMPs. 
We then used Fisher’s exact test on the sum of methylated and unmethylated reads 
in both siblings, averaged across positions within the region. Resulting P values 
were corrected with Storey’s method*! and an FDR of 5% was accepted. 
Statistically significant regions from different strains were merged if they over- 
lapped by at least 20% of their combined length and if the methylation change was 
in the same direction compared to the 3rd generation lines. Short regions contain- 
ing only a small number of strongly differential sites were excluded by requiring 
DMRs to have a minimum length of 50 bp and to contain at least ten methylated 
positions and at least five DMPs. 

Mapping of DMPs, N-DMPs and DMRs to genomic elements. We used the 
TAIR10 annotation (http://www.arabidopsis.org/portals/genAnnotation/gene_ 
structural_annotation/annotation_data.jsp) to determine overlap of genes, pseudo- 
genes and transposable elements with methylated positions. We defined intergenic 
regions as regions that did not correspond to any annotated feature. A DMR was 
considered mapping to a particular genomic element if it overlapped with such an 
element for more than 20% of its length. 

Assessing the distance to the closest siRNA or transposable element. We 
determined the distance between a methylated position and the closest upstream 
and downstream siRNA using a published data set for A. thaliana aerial tissue’ 
(NCBI GEO accession number GSM518432). For transposable elements, we used 
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the TAIR10 annotation. Statistical significance was tested with a two-sided, 
unpaired Student’s t-test on the measured distances. Pericentromeric regions were 
defined as described”. 

Analysis of gene expression. DNA sequences resulting from RNA library pre- 
paration were barcode-sorted and quality filtered in SHORE”, and aligned using 
BWA” to the TAIR1O gene annotation. Reads were filtered for duplicates and 
required to have a mapping quality of at least 37. The remaining mappings were 
used to generate gene-level counts for expression analysis. We only considered 
genes for which the total counts in all samples combined exceeded 30. Between- 
sample expression correlations and strain-distribution plots were used for quality 
control to identify poor samples, and pairwise comparisons of expression were 
performed using the DEseq package* implemented in R. Differentially expressed 
genes were identified by a combination of per-gene variance (P= 0.01, with 
Benjamini and Yekutieli correction®) and common variance (=2X change). 
Density of genes without expression support were plotted along chromo- 
somes in sliding windows considering only genes with at least 50 methylated or 
unmethylated calls. Windows of gene methylation were calculated for entire gene 
bodies as the fraction of methylated positions (methylated in any sample) divided 
by the total number of called positions. Very similar results were obtained con- 
sidering sites methylated in all samples. Data were visualized with the ggplots 
package in R*. 

Data visualization. A Gbrowse instance of the methylation profiles is available at 
http://gbrowse.weigelworld.org/fgb2/gbrowse/ath_methyl_ma. 
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Predicting mutation outcome from early stochastic 
variation in genetic interaction partners 


Alejandro Burga', M. Olivia Casanueva! & Ben Lehner)? 


Many mutations, including those that cause disease, only have a 
detrimental effect in a subset of individuals. The reasons for this 
are usually unknown, but may include additional genetic variation 
and environmental risk factors’. However, phenotypic discordance 
remains even in the absence of genetic variation, for example 
between monozygotic twins’, and incomplete penetrance of muta- 
tions is frequent in isogenic model organisms in homogeneous 
environments**. Here we propose a model for incomplete penetrance 
based on genetic interaction networks**. Using Caenorhabditis 
elegans as a model system, we identify two compensation mechan- 
isms that vary among individuals and influence mutation outcome. 
First, feedback induction of an ancestral gene duplicate differs across 
individuals, with high expression masking the effects of a mutation. 
This supports the hypothesis that redundancy is maintained in 
genomes to buffer stochastic developmental failure’. Second, during 
normal embryonic development we find that there is substantial 
variation in the induction of molecular chaperones such as Hsp90 
(DAF-21). Chaperones act as promiscuous buffers of genetic vari- 
ation®”, and embryos with stronger induction of Hsp90 are less likely 
to be affected by an inherited mutation. Simultaneously quantifying 
the variation in these two independent responses allows the pheno- 
typic outcome of a mutation to be more accurately predicted in 
individuals. Our model and methodology provide a framework for 
dissecting the causes of incomplete penetrance. Further, the results 
establish that inter-individual variation in both specific and more 
general buffering systems combine to determine the outcome 
inherited mutations in each individual. 

It is well established that isogenic individuals show substantial vari- 
ation at the molecular level, for example in the expression levels of 
particular genes'®. This variation can be an important influence on 
signalling and development in wild-type (WT) individuals'’"’. Such 
molecular variation may also influence the phenotypic consequences 
of inherited mutations, but examples of molecular variation have only 
been described that predict phenotypic variation at the level of other 
molecules’ or cellular phenotypes”. 

To identify determinants of mutation outcome, we present a model 
for incomplete penetrance based on genetic interaction networks*® 
(Fig. 1b). Our model proposes that in the absence of additional genetic 
variation, it is stochastic variation in the abundance or activity of 
genetic interaction partners (genes that influence the outcome of a 
mutation when genetically altered) that determines the outcome of a 
mutation (Fig. 1d). The correlated loss of genetic, environmental and 
stochastic robustness upon gene deletion in yeast is highly consistent 
with the generality of this proposal®”. 

To test our model, we first used a null mutation in the T-box tran- 
scription factor gene tbx-9 (Supplementary Fig. 1) that causes an 
incompletely penetrant defect in C. elegans larval morphology owing 
to abnormal development of the epidermis and muscle'®”” (Fig. 1a). 
Transcription factor tbx-9 is related to another transcription factor, 
tbx-8, by an ancestral gene duplication; inactivation of tbx-8 causes 


similar incompletely penetrant defects, and loss of both genes results in 
synthetic lethality in C. elegans'®” and Caenorhabditis briggsae’*. 

If the incomplete penetrance of tbx-9 relates to variation in the 
expression of tbx-8, then increased expression of txb-8 should be able 
to compensate for the loss of tbx-9. Expression of TBX-8 from a 
transgene indeed reduces the penetrance of a thx-9(0k2473) null allele 
(Fig. 1c and Supplementary Fig. 2). A further requisite of our model is 
the existence of inter-individual variation in the expression of the 
genetic interaction partner. We quantified the induction of a fluor- 
escent reporter driven by the thx-8 promoter (ptbx-8::GFP) during 
normal development and found that there was substantial variation 
among embryos (Fig. 2a) with an important extrinsic component 
(Supplementary Figs 3a and 4). 

We next quantified the induction of the tbx-8 reporter in thx- 
9(0k2473) mutant animals and found that expression was increased 
(Fig. 2b, 1.2-fold upregulation at comma stage, P = 1.6 X 10 *). We 
confirmed this upregulation by performing single molecule fluor- 
escence in situ hybridization against the endogenous thx-8 messenger 
RNA (mRNA) (Supplementary Fig. 5, 1.6-fold upregulation in 40- to 
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Figure 1 | Genetic interactions provide a general model for incomplete 
penetrance. a, Inactivation of the gene tbx-9 in C. elegans results in an 
incompletely penetrant defect, with approximately half of embryos hatching 
with abnormal morphology (small arrow). b, Representation of a negative 
(synergistic) genetic interaction between two genes A and B. c, Increased 
expression of TBX-8 reduces the penetrance of a thx-9(0k2473) null mutation 
(49% (n = 138) compared with 34% (n = 139), a 33% decrease in abnormal 
phenotypes, P = 0.011, Fisher’s exact test). Likewise, increased TBX-9 
expression rescues the penetrance of a tbx-8(0k656) null mutation (60% 

(n = 164) compared with 35% (n = 195), a 42% decrease, P = 2.8 X 10°°).dA 
model for incomplete penetrance based on variation in the activity of genetic 
interaction partners. 
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Figure 2 | Early inter-individual variation in the induction of ancestral gene 
duplicates predicts the outcome of inherited mutations. a, Quantification of 
total green fluorescent protein (GFP) expression from a thx-8 reporter during 
embryonic development in WT (black) and thx-9(0k2473) (green) individuals. 
Each individual is a separate line. a.u., Arbitrary units. b, Boxplot of thx-8 
reporter expression (a) showing 1.2-fold upregulation in a tbx-9 mutant at 
comma stage (~290 min, P = 1.6 X 10°, Wilcoxon rank test). c, Expression of 
tbx-8 reporter in a thx-9(0k2473) background for embryos that hatch with (red) 
or without (blue, WT) a morphological defect. d, Boxplot of ¢ showing thx-8 
expression is higher in thx-9 embryos that develop a WT phenotype (blue) 
compared with those that develop an abnormal (red) phenotype at comma 
stage (P= 6.1 X 10 3).e, Expression of a ptbx-9::GFP reporter in WT (black) 
and tbx-8(0k656) mutant (green). f, Boxplot of tbx-9 reporter showing 4.3-fold 


50-cell-stage embryos, P = 5.3 X 10 *). Thus, a direct or indirect feed- 
back mechanism exists that upregulates transcription of tbx-8 when its 
ancestral paralogue is inactivated. This feedback also acts on the tbx-9 
promoter, which is also upregulated in a thx-9(0k2473) mutant back- 
ground (Supplementary Fig. 6). This is highly consistent with observa- 
tions that have been made in yeast’’”° and suggests that compensatory 
expression by negative feedback regulation of gene duplicates is 
probably a conserved phenomenon across species. 

Next, we tested whether variation in the induction of tbx-8 corre- 
lated with the outcome of the tbx-9(0k2473) mutation at hatching. We 
retrospectively compared the early expression of the reporter in 
embryos that did, and did not, hatch with an abnormal pheno- 
type. Early expression was higher in the second class (Fig. 2c, d, 
P=6.1X 10 °);whereas 80% of the embryos with reporter expression 
in the highest quartile hatched without a defect, only 40% of those in 
the lowest quartile showed no phenotype (Supplementary Table 1, 
P=3.3%X10 °). In contrast, early expression from a ubiquitously 
transcribed promoter (plet-858::GFP) did not predict phenotypic 
outcome in individuals (Supplementary Fig. 7a, b, P = 0.58), nor did 
variation in the expression of a reporter for the transcription factor 
ELT-5, another protein required for epidermal development” 
(Supplementary Fig. 7c, d, P = 0.97). High expression of the synthetic 
lethal partner tbx-8, but not high transcription in general, is therefore 
partly epistatic”* to the loss of tbx-9. We also tested the reverse situation: 
that is, whether variation in tbx-9 expression compensation (Fig. 2e, f, 


L1 stage phenotype 


upregulation at comma stage (~375 min, P = 3.6 X 10° '°). g, Expression of 
tbx-9 reporter in a tbx-8(0k656) mutant background, colour code as in 

c. h, Boxplot of g showing tbx-9 expression is higher in thx-8 embryos that 
develop a WT phenotype (P = 0.033). i, Expression of a pflh-2::GFP reporter in 
WT (black) and flh-1(bc374) mutant (green). j, Boxplot of flh-2 reporter 
expression (i) showing 1.8-fold upregulation in a flh-1 mutant at comma stage 
(~180 min, P = 2.2 X 107°). k, Bright-field and fluorescence image of an 
approximate 100-cell flh-1; pflh-2::GFP embryo. Red arrow indicates the local 
expression of flh-2 reporter quantified for flh-1 phenotypic prediction. 

1, Boxplot showing higher flh-2 reporter expression at approximate 100 cells for 
WT (blue) compared with abnormal (red) phenotypes (P = 0.014). Boxplots 
show the median, quartiles, maximum and minimum expression in each 
data set. 


4.3-fold upregulation, P= 3.6 x 10 '°) predicted the phenotypic 
outcome of a thx-8(0k656) mutation. We found that was indeed the 
case (Fig. 2g, h, P = 0.033, see Supplementary Table 1). 

The FLYWCH transcription factors flh-1 and flh-2 are an additional 
pair of ancestral gene duplicates that redundantly repress embryonic 
expression of the microRNAs lin-4, mir-48 and mir-241 (ref. 23). 
Similar to the case for tbx-9 null mutants, flh-1(bc374) mutant embryos 
induced higher levels of a pflh-2::GFP reporter than WT embryos 
(Fig. 2i, j, 1.8-fold at comma stage, P< 2.2 x 10 1°). In contrast, we 
found neither upregulation of a tbx-8 reporter in an flh-1(bc374) back- 
ground, nor upregulation of an flh-2 reporter in a thx-9(0k2473) 
mutant (Supplementary Fig. 8, P= 0.74 and P= 0.55 respectively). 
filh-1(bc374) embryos that expressed higher flh-2 reporter levels early 
in development (Fig. 2k, see Methods) were, however, more likely to 
develop into morphologically WT larvae (Fig. 21, P= 0.014, see 
Supplementary Table 1). In contrast, we found that pflh-2::GFP 
reporter levels did not predict the outcome of a tbx-9(0k2473) null 
mutation (Supplementary Fig. 7g, h, P = 0.32), nor did induction of 
the ptbx-8::GFP reporter predict the outcome of the flh- 1(bc374) muta- 
tion (Supplementary Fig. 7e, f, P = 0.49). 

Many ancestral gene duplicates have retained partly redundant func- 
tions over extensive evolutionary periods'’****. One explanation for 
this could be the selection pressure provided by stochastic develop- 
mental errors’”® (‘canalization””’). Our results provide direct empirical 
support for this hypothesis, showing that when one member of a partly 
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redundant gene pair is inactivated, expression variation in the other 
gene becomes an important influence on the phenotype of an 
individual. 

The variation in the induction tbx-8 that we can quantify only partly 
accounts for variation in the outcome of the tbx-9 mutation (Fig. 2d). 
We considered that one additional influence on mutation outcome 
could be variation in the activity of general buffering systems such as 
molecular chaperones. In multiple eukaryotes, chaperone inhibition can 
enhance the effects of diverse mutations*””* and increased chaperone 
activity promiscuously suppresses detrimental mutations in bacteria”””. 

We constructed a transcriptional reporter for the constitutively 
expressed chaperone daf-21 (Hsp90). Variability in the pdaf- 
21::mCherry reporter was evident during all larval stages and, 
surprisingly, its induction during embryonic development also varied 
substantially, even in the absence of a mutation or environmental 
perturbation (Supplementary Figs 9a and 10). No difference in trans- 
gene copy number was detected between animals with high or low 
daf-21 reporter expression (Supplementary Fig. 11), and individuals 
with high reporter expression have high expression of the DAF-21 
protein (Fig. 3a and Supplementary Fig. 12). 
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Figure 3 | Inter-individual variation in chaperone induction predicts the 
outcome ofa mutation. a, pdaf-21::mCherry; glp-1(e2141) animals were sorted 
into ‘high’ and ‘low’ groups based on their pdaf-21::mCherry expression 24h 
after the L1 stage at 25 °C. Western blot analysis confirmed that worms 
belonging to the ‘high’ group had higher levels of endogenous DAF-21 protein. 
As a positive control, mCherry protein levels showed the same trend. B-Actin 
was used as loading control; results of three replica sortings are shown. 

b, c, Correlation in individuals (measured at comma stage) between: pdaf- 
21::mCherry and pdaf-21::GFP (Pearson correlation coefficient, r = 0.77, 
P=3.4X 10 °) (b)s pdaf-21::mCherry and phsp-4::GFP (r = 0.63, 

P=2.8X10 “)(c).de, Expression of daf-21 reporter in a thx-9(0k2473) 
mutant background. Embryos that hatch into phenotypically WT worms (blue) 
have higher expression than those hatching with a morphological defect (red) at 
the comma stage (P = 1.9 X 10°). 
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Wetested whether this variability was particular to daf-21 or common 
to other chaperones using a reporter for the gene hsp-4 (orthologous to 
mammalian BiP). Transcription from the hsp-4 promoter also varied 
extensively among embryos (Supplementary Fig. 13) and was correlated 
with that from the daf-21 promoter (Fig. 3c, Pearson correlation coef- 
ficient, r= 0.63, P= 2.8 X 10°), similar to the correlation between 
two independent daf-21 reporters (Fig. 3b, r = 0.77, P = 3.4 X 10 *). 
This and additional correlations between stress response genes 
(Supplementary Fig. 14) suggest that, during normal development, 
isogenic embryos differ in a coordinated manner in the transcriptional 
induction of multiple chaperones. 

We quantified induction of a pdaf-21::mCherry reporter in a tbx- 
9(0k2473) background. The mean induction of this reporter was 
increased in thx-9(0k2473) animals (Supplementary Fig. 9b, 1.8-fold 
upregulation, P = 6.6 x 10 *). In addition, embryos that later hatched 
without a morphological defect expressed higher levels of the reporter 
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Figure 4 | Simultaneous quantification of inter-individual variation in two 
buffering systems accurately predicts the outcome of a mutation. 

a, Expression levels of thx-8 and daf-21 reporters in individual tbx-9(0k2473) 
mutant embryos at the comma stage of development are not correlated 

(r = 0.061, P = 0.56). Each point represents one individual embryo. Embryos 
that hatched into phenotypically WT larvae are blue; embryos that hatched 
with a morphological defect are red. The median values for each reporter are 
indicated (black lines); embryos with expression above or below the median are 
defined as high (H) and low (L), respectively. b, Percentage of embryos from 
each expression class hatching with normal morphology. Representative 
bright-field, green and red fluorescence images are shown. Prediction scores for 
different time points and measures are given in Supplementary Table 3. 
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early on in development (Fig. 3d, e, P= 1.9 x 10 >, see Supplementary 
Table 1). The daf-21 reporter also predicted thx-8(0k656) mutation 
outcome (Supplementary Figs 15 and 16). Higher chaperone expres- 
sion during early embryonic development therefore predicts a reduced 
effect of the inherited mutation. 

We next constructed a strain allowing us to quantify simultaneously 
inter-individual variation in the expression of both buffering systems. 
Expression from the tbx-8 and daf-21 reporters does not correlate 
across individuals (Fig. 4a, r= 0.061, P=0.56), showing that the 
two buffering systems vary independently. We retrospectively divided 
tbx-9(0k2473) mutation embryos into four approximately equally 
populated groups, depending upon whether their expression was 
above or below the median for each of the reporters, and examined 
the proportion of individuals hatching as WT larvae in each group 
(Fig. 4b). Whereas 30% of embryos with below-median expression of 
both reporters hatched without an abnormal phenotype (‘LL’ embryos, 
Fig. 4a, b, P=1.7X 10 °), 68% and 63% of embryos with above- 
median expression of a single reporter (daf-21 or tbx-8, respectively) 
showed no morphological defect (Fig. 4b). Strikingly, 92% of embryos 
with above median expression of both reporters hatched as pheno- 
typically WT larvae (“HH’ embryos, Fig. 4a, b). That is, non-genetic 
variation in two buffering systems can be almost completely epistatic” 
to the tbx-9 mutation. Receiver operating characteristic curve analysis 
confirmed the independent contribution of both reporters to pheno- 
typic predictions (Supplementary Fig. 17). 

In summary, we have shown here that incomplete penetrance is not 
just a direct consequence of failure caused by mutations’*", but that 
organisms have plastic compensatory responses’ in both specific 
and more promiscuous genetic interaction partners that vary among 
individuals. It is the combination of this variation that determines the 
outcome of each mutation in each individual. 

Inherited mutations also frequently have variable consequences in 
other species, including in human disease’. Based on our findings in 
C. elegans, we propose that incomplete genetic compensation may 
play a role in human disease, influencing the outcome of inherited 
polymorphisms in each individual. 


METHODS SUMMARY 


The expression from individual developing embryos was quantified using 
customized imaging and analysis protocols. Embryos were released from adult 
worms by dissection and sorted according to their developmental stage. 
Phenotypes were scored at hatching. Reporter constructs were generated by micro- 
injection or bombardment. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression profiling: image acquisition. Synchronous populations of tbx- 
9(0k2473) animals were obtained by treating worms with sodium hypochlorite 
and then allowing embryos to develop at 25°C. For a typical time-lapse, around 
30-50 gravid worms were picked and then washed three times with M9 buffer in a 
watch glass. Worms were then transferred to a 50 mm glass-bottomed culture dish 
(MatTek Corporation). Adults were dissected and released embryos were manu- 
ally sorted. In a period of time not exceeding 15 min, 4- to 12-cell-stage embryos 
were collected and carefully placed with an eyelash-pick into the centre of the dish; 
the rest of embryos and debris were removed using a pipette. A cover slip was 
placed on top of the worms and sealed with paraffin. Fluorescence and bright-field 
images in multiple positions were acquired with a X10 Plan- NEOFLUAR 0.3 NA 
objective on a Zeiss Cell Observer HS system consisting of an inverted microscope 
(AxioObserver.Z1) equipped with an automated stage, a C9100-13 IMAG-EM 
Dual Mode EM-CCD camera (Hamamatsu) and a Sutter DG-4 fast switching 
xenon light source (Sutter Instrument Company). Temperature control was 
achieved using a custom microscope incubator box and temperature control 
device developed by the European Molecular Biology Laboratory (http://www. 
embl-em.de) coupled to a Unichiller cooling unit (Huber Kaltemaschinenbau) 
for temperature control below room temperature. 

At the start of imaging, approximately 25-30 min had elapsed since the dissec- 
tion of worms. During the first 20 min of the time-lapse no fluorescence illumina- 
tion was applied, to avoid phototoxicity as early embryos are very sensitive to 
illumination. Subsequently bright-field and fluorescence images were obtained 
every 10min for 8h, followed by just bright-field images collected every 2 min 
for 4 more hours or until most of the worms had hatched. The increase in bright- 
field frequency facilitated the phenotypic score of hatching worms. Exposure times 
and frequency were adjusted to maximize the signal-to-noise ratio while avoiding 
phototoxicity (evaluated as approximately 100% hatching of WT (N2) embryos). 

The imaging conditions for the daf-21, let-858, elt-5 and flh-2 reporters ina tbx-9 
mutant background were exactly the same as for the thx-8 reporter. In the other 
genetic backgrounds exposure times and acquisition characteristics were adjusted 
according to the characteristics of the reporter construct. The tbx-9 reporter was 
expressed earlier than the tbx-8 reporter and at lower levels, and the exposure time 
was increased 2.5X to compensate for this, while reducing the frequency of 
acquisition to once every 25 min. The flh-2 reporter was also expressed at lower 
levels, but the induction was later in development. For this reporter the exposure 
time was increased (2.5 compared with that for the thx-8 reporter), fluorescence 
acquisition was initiated 1 h later than bright field, but images were still obtained 
once every 10 min. 

When comparing the expression of a particular reporter in two different genetic 
backgrounds, embryo isolation was performed in 10 min, one strain immediately 
after the other (total time of 20 min), and image acquisition was performed simul- 
taneously using the conditions previously established for each strain. 
Expression profiling: quantification of gene expression. Total fluorescence 
from each embryo was quantified using Image)” (http://rsb.info.nih.gov/ij/). 
First, we performed a global background correction of all images by subtracting 
an image taken in a region of no fluorescence for each time point. For the pelt- 
5::mCherry reporter data set, an additional flat field correction step was necessary. 
Background-subtracted images were divided by the homogeneous fluorescence 
reference to correct for non-homogeneous illumination, and the resulting images 
used for quantification**. The homogeneous fluorescent reference was created by 
imaging a drop of 0.05 mg ml Nile Red solution in M9 buffer. All embryos were 
visually identified and tagged so that no embryo was counted twice. A region of 
interest (ROI) was manually drawn around each embryo and an equally sized ROI 
drawn in the background vicinity; finally, the total fluorescence from each embryo 
was calculated as the integrated density difference between the embryo containing 
ROJ and the local background ROL For the particular case of the flh-2 reporter, at the 
100-cell stage most of the reporter signal came from a very restricted area of the 
embryo. Because embryo background fluorescence was masking this specific source 
of expression, we quantified it by first drawing an equally sized circle-shaped ROI in 
all embryos around the signal source and measuring the integrated density. This 
quantified expression was used for flh-1 phenotypic prediction. 

All embryos were staged using three developmental landmarks: (1) number of 
cells at beginning of time-lapse, (2) elapsed time until comma stage and (3) elapsed 
time until twofold-stage. Only those embryos starting the time-lapse at the 
approximately 6- to 15-cell stage and reaching the next development landmarks 
with a delay of less than or equal to 30 min were considered for further analysis. In 
a typical time-lapse, around 10-15 embryos fulfilled those requirements. Data for 
each reporter-mutant pair prediction included embryos from at least four inde- 
pendent experiments. To correct for day-to-day technical variation (Supplemen- 
tary Fig. 18), the expression of each embryo was normalized using the mean 
expression of the population (or populations when comparing the expression of 


different genotypes) for each time-lapse. For visualization, five time-point 
smoothing was used for 10min acquisition frequency recordings, and three 
time-point smoothing used when the frequency was 25 min. 

Data analysis, statistics and figures were generated using R (http://www.r-project. 

org). Receiver operating characteristic curves for dual reporter predictions were 
constructed using the ROCR package*’. Expression levels of thx-8 and daf-21 at 
comma stage were normalized by subtracting the mean and dividing by the standard 
deviation of each population. The sum of the normalized values for each embryo was 
used to calculate the receiver operating characteristic curve of the joint contribution. 
Phenotypes. The phenotype of each embryo was scored as WT or abnormal by 
visual inspection of the hatching morphology. Any abnormal morphology was 
defined as non-WT. To study the prediction of mutation expressivity, tbx- 
9(0k2473) animals that hatched with an abnormal phenotype were further classified 
into two classes. ‘Mild’ abnormality indicated individuals with a characteristic 
posterior depression but that conserve the general body structure. ‘Severe’ embryos 
were those with severe malformations along most of the posterior axis or the entire 
body (Supplementary Fig. 19). 
Nematode strains and growth conditions. Worms were grown at 20 °C on NGM 
plates using Escherichia coli OP50 as a food source unless otherwise noted. The 
strains used in and constructed for this study are listed in Supplementary Table 2. 
Wild type was Bristol N2 (ref. 34). Reporter strains ptbx-8::GFP (UL2520) and pelt- 
5::GFP (SD1434) were provided by I. Hope and S. Kim, respectively. The strain 
VT1343 (flh-1(bc374)) was a gift from V. Ambros. Some deletion mutations used 
in this work were provided by the C. elegans Gene Knockout Consortium. 

We optimized worm growth and embryo imaging conditions for each mutant, 
to obtain a penetrance with a reasonable representation of embryos hatching with 
both WT and morphologically abnormal phenotypes. The tbx-9(0k2473) mutants 
were maintained at 25 °C and embryos were allowed to develop at 27.5 °C to givea 
penetrance of approximately 50%; thx-8(0k656) were grown at 16 °C and embryos 
allowed to develop at 20°C; flh-1(bc374) mutants were grown at 20°C and 
embryos allowed to develop at 26 °C. For flh-2(bc375) mutants, a growth condition 
was not found where the penetrance was high enough to be usefully studied. 
Reporter gene constructs. We generated transcriptional reporters for the genes 
tbx-9, flh-2 and daf-21. Promoter regions of these genes were amplified from N2 
genomic DNA by PCR using the following primer pairs: FW-ptbx-9, 
5'-TTGGGTTCAGATAACAATTTGG-3’; RV-ptbx-9, 5’-ATTTTTTGTCTGA 
AAACGTTAAAATA-3'; FW-pflh-2, 5’-GCGCTTCTCGTGGGCTCT-3’; RV- 
pflh-2, 5’-ATACAGGCGGTCTGAAAA-3’; FW-pdaf-21, 5'-CGAAACGGTCG 
AATTTCATAA-3'; RV-pdaf-21, 5’-ATGGTTCTGGAAAAATATCAATTA-3’. 
The lengths of the amplified regions were 1.5 kb, 1.5 kb and 1.9kb, respectively. 
The promoters were cloned into the MultiSite Gateway Entry vector pENTR 
5'-TOPO following the manufacturer’s instructions (Invitrogen) and then trans- 
ferred by three-fragment MultiSite Gateway Pro LR reaction (Invitrogen) into the 
pCFJ150-pDESTttTi5605[R4-R3] destination vector. Middle Entry vectors used 
were pENTRwGFP or pENTRmCherry; 3’ Entry vector was pCM5.37 containing 
the unc-54 3’ untranslated region (UTR). Final constructs were confirmed by PCR 
amplification and sequencing. 

Strain construction. Strains carrying an extrachromosomal array expressing ptbx- 
8: TBX-8::unc-54 3’ UTR or ptbx-9::TBX-9::unc-54 3' UTR were generated by 
microinjection as previously described** using pmyo-2::mCherry as a co-injection 
marker. The transcriptional reporter ptbx-9::GFP was generated by microinjection 
using pmyo-2::mCherry as a co-injection marker, followed by integration using the 
ultraviolet irradiation method**. The transcriptional reporters pdaf-21::mCherry, 
pdaf-21::GFP, phsp-4::mCherry, ptbx-8::mCherry and pflh-2::GFP were generated by 
bombardment in an unc-119(ed3) background’””*. All transgenic strains were back- 
crossed at least four times. No transgenic strains displayed abnormal phenotypes. 
Quantitative PCR. Worms carrying the pdaf-21::mCherry transgene were syn- 
chronized by hypochlorite treatment, plated as L1 larvae and allowed to develop 
for 24h at 25°C. Larvae were then sorted by hand using a MVX10 Macro Zoom 
fluorescence microscope (Olympus) into ‘high’ and ‘low’ groups based on their 
mCherry fluorescence levels. Each group had approximately 20 larvae and sorting 
was performed in triplicate. Worms were washed in M9 and transferred to 1.5 ml 
Eppendorf tubes containing 200 ul of lysis Buffer (50 mM KCI, 10 mM Tris-HCl 
(pH 8), 2.5 nM MgCh, 0.45% Nonidet P-40, 0.45% Tween20) supplemented with 
20 ug proteinase K and 201g RNase A. Tubes were frozen in liquid nitrogen, 
followed by incubation at 65°C for 1h and 95°C for 30min. Genomic DNA 
was extracted by adding 500.1 Phenol:Chloroform:Isoamyl Alcohol 25:24:1 
(Sigma) and precipitated using 600 ul of ice cold ethanol and 50,1 of 3M 
NaAc. Quantitative PCR was performed on a LightCycler 480 machine with 
SYBR green detection (Roche). The transgene was amplified with primers target- 
ing the mCherry coding region: FW-mCherry, 5'-CTACGACGCTGAGGTC 
AAGA-3'; RV-mCherrry, 5’-CGATGGTGTAGTCCTCGTTG-3’. Transgene 
levels were normalized by the mean level of two other loci in the genome using 


©2011 Macmillan Publishers Limited. All rights reserved 


the following primers: FW-daf-21ORF, 5’-TCCAATGACTGGGAAGATCA-3’; 
RV-daf-21ORF, 5'-CGAACGTAGAGCTTGATGGA-3’; FW-csq-1, 5’-AACTG 
AGGTTCTGACCGAGAAG-3'; RW-csq-1, 5’-TACTGGTCAAGCTCTGAGT 
CGTC-3’. 

Western blotting. To test whether expression of the pdaf-21::mCherry predicted 
differences in the levels of DAF-21 protein expressed from the endogenous gene, 
pdaf-21::mCherry; glp-1(e2141) animals were synchronized as L1 larvae by hypo- 
chlorite treatment and starvation at 20°C overnight. Worms were transferred to 
plates and allowed to develop for 24h at 25 °C (glp-1(e2141) mutation restrictive 
temperature; this mutation prevents the development of the germ line; the pdaf- 
21::mCherry reporter does not express in germline cells). Larvae were then sorted 
using an MVX10 Macro Zoom fluorescence microscope (Olympus) into ‘high’ 
and ‘low’ groups based on their mCherry fluorescence. Each group had approxi- 
mately 30 larvae and sorting was performed in triplicate. Worms were washed in 
M9 buffer, frozen in liquid nitrogen, dissolved in 30 ul of SDS loading buffer, 
sonicated for 5min at 4°C and then boiled at 95°C for 5min. Samples were 
loaded into a 12.5% SDS polyacrylamide gel and transferred to a nitrocellulose 
membrane. Primary antibodies were incubated overnight at 4 °C. The following 
antibodies and dilutions were used: rabbit polyclonal anti-B-actin (ab8227) 
(Abcam) 1:1,000; rabbit polyclonal anti-DsRed (ClonTech) 1:1,000; mouse 
monoclonal anti-DAF-21 608F hybridoma supernatant” (a gift from J. Miwa 
and Y. Yamaguchi) 1:40; anti-rabbit IgG peroxidase conjugate (Jackson 
ImmunoResearch) 1:40,000 and anti-mouse IgG peroxidase conjugate (Sigma- 
Aldrich) 1:8,000. Quantification of DAF-21 levels in N2 WT and daf-21(+/—) 
was performed in a similar fashion; 30 L4 larvae of each genotype were picked in 
triplicate. We detected peroxidase activity with SuperSignal West Femto Substrate 
(Thermo Scientific) and blot imaging was performed using a Fujifilm LAS-3000 
luminescent image analyser with exposures times in the lineal range of detection. 
Finally, band intensities were quantified using Image]. 

Single molecule fluorescence in situ hybridization. We followed the protocol of 
Raj et al.*° with the following modifications. Probes against thx-8 mRNA were 
generated with the online tool available at www.singlemoleculefish.com and syn- 
thesized by Biosearch Technologies. Probes were coupled to CAL Fluor Red 590 
fluorophore. The formamide concentration in hybridization and wash buffer was 
20% and the total concentration of pooled probes was 25 nM. Hybridization was 
performed overnight at 30 °C. We imaged embryos using an oil immersion < 100 
objective on a Leica DMI4000 inverted microscope equipped with an Evolve 512 
EMCCD camera (Photometrics) and a Lumen 200 metal arc lamp (Prior 
Scientific). We collected stacks of typically 25 images spaced by 0.3 um for each 
embryo. Stacks were processed applying a Laplacian of Gaussian filter with the 
LoG3D plugin*’ for ImageJ. Then dimensionality was reduced by applying a 
Z-maximum intensity projection (the final image contains the maximum value 
over all images in the stack at the particular pixel location), and a single threshold 
was applied to each embryo by visual inspection as recommended. The total 
number of particles was counted with the ‘Analyze particles’ ImageJ tool. 
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Embryos were staged by counting the total number of nuclei stained with 
4',6-diamidino-2-phenylindole (DAPI). 

Rescue experiments. Worms mutant for thx-9 were crossed with a line carrying 
an extrachromosomal array overexpressing TBX-8 protein and a pmyo- 
2:mCherry reporter as a marker. Adult gravid worms growing at 25°C were 
manually sorted under an MVX10 Macro Zoom fluorescence microscope 
(Olympus) into two groups, one carrying the fluorescent marker, the other lacking 
it. This latter group served as an internal negative control. Extrachromosomal 
array transmission was around approximately 50%, meaning that not all embryos 
in the overexpression group actually carried the transgene, so rescue scores are 
probably an underestimation. Four- to 12-cell stage embryos were collected in 
50 mm glass-bottomed dishes following the same protocol as for time-lapses, and 
incubated at 27.5 °C. Phenotypes of worms were scored 15h later. In the case of 
tbx-8 mutation rescue, worms were grown at 16°C and incubated at 20°C. 
Control experiments were performed crossing the tbx-9(0k2473) and tbx- 
8(0k656) mutations to a strain carrying the extrachromosomal array 
wwEx26|pmir-42-44::GFP+ unc-119(+)]. This transcriptional reporter is 
expressed at high levels during early and late development, and its transmission 
is approximately 95%. Rescue experiments were performed at least in triplicate for 
each pair. Finally, repeats were pooled and significance was tested using Fisher’s 
exact test. 
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Mapping intact protein isoforms in discovery mode 
using top-down proteomics 
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A full description of the human proteome relies on the challenging 
task of detecting mature and changing forms of protein molecules 
in the body. Large-scale proteome analysis’ has routinely involved 
digesting intact proteins followed by inferred protein identifica- 
tion using mass spectrometry’. This ‘bottom-up’ process affords a 
high number of identifications (not always unique to a single 
gene). However, complications arise from incomplete or ambigu- 
ous’ characterization of alternative splice forms, diverse modifica- 
tions (for example, acetylation and methylation) and endogenous 
protein cleavages, especially when combinations of these create 
complex patterns of intact protein isoforms and species’. “Top- 
down’ interrogation of whole proteins can overcome these 
problems for individual proteins**, but has not been achieved on 
a proteome scale owing to the lack of intact protein fractionation 
methods that are well integrated with tandem mass spectrometry. 
Here we show, using a new four-dimensional separation system, 
identification of 1,043 gene products from human cells that are 
dispersed into more than 3,000 protein species created by post- 
translational modification (PTM), RNA splicing and proteolysis. 
The overall system produced greater than 20-fold increases in both 
separation power and proteome coverage, enabling the identifica- 
tion of proteins up to 105 kDa and those with up to 11 transmem- 
brane helices. Many previously undetected isoforms of endogenous 
human proteins were mapped, including changes in multiply 
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Figure 1 | The four-dimensional platform for high-resolution fractionation 
of protein molecules. Schematics (top) and photographs (middle) are shown 
for (a) a custom device for sIEF, (b) a custom device for multiplexed GELFrEE 
and (c) RPLC coupled to mass spectrometry. Representative one-dimensional 
gels of fractions collected from the two electrophoretic devices are shown below 
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modified species in response to accelerated cellular ageing 
(senescence) induced by DNA damage. Integrated with the latest 
version of the Swiss-Prot database’®, the data provide precise corre- 
lations to individual genes and proof-of-concept for large-scale 
interrogation of whole protein molecules. The technology pro- 
mises to improve the link between proteomics data and complex 
phenotypes in basic biology and disease research’. 

Effective fractionation* ”’ is critical for sample handling before mass- 
spectrometry-based proteomics. So far, no fractionation procedure for 
intact proteins can match the resolution of two-dimensional gel elec- 
trophoresis (two-dimensional gels). Here we use a liquid phase alterna- 
tive to two-dimensional gels that bypasses both their low recovery and 
extensive workup steps before mass spectrometry". This procedure for 
two-dimensional liquid electrophoresis'* comprises solution isoelectric 
focusing (sIEF) followed by gel-eluted liquid fraction entrapment elec- 
trophoresis (GELFrEE)"* for fractionation by protein isoelectric point 
and size, respectively (Fig. 1a, b). Combining these with nanocapillary 
liquid chromatography and mass spectrometry (LC-MS) (Fig. 1c) for 
both low“ and high molecular mass proteins’* results in an overall four- 
dimensional separation of whole protein molecules before ion frag- 
mentation by tandem mass spectrometry and protein identification. 

Using the four-dimensional platform described above, we generated 
a quasi-two-dimensional gel perspective of the human proteome with 
extremely high molecular detail (Fig. 2a) from individual replicate 
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their pictures; note the resolution attainable at the level of intact proteins. The 
combined resolution of RPLC with Fourier-transform mass spectrometry is 
depicted by the chromatogram along with selected isotopic distributions for 
protein ions measured during the run. 
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Figure 2 | Two visual representations of proteome-scale runs. a, The heat 
map is generated from combined four-dimensional runs of nuclear and 
cytosolic extracts. Intact mass and isoelectric point values are indicated on the 
y- and x-axes, respectively. Each box in the grid displays total ion 
chromatograms from LC runs of two-dimensional liquid electrophoresis 
fractions plotted as time versus neutral intact mass. The mass spectrometry 
intensity is indicated by colour (legend on top left). Representative precursor 


analyses of nuclear and cytosolic extracts of HeLa S3 cells (Supplemen- 
tary Fig. 1). In discovery mode, the IEF~-GELFrEE-nanocapillary 
liquid chromatography platform used 0.5-1 mg of input protein and 
provided a peak capacity of well over 2,000 for separation of protein 
molecules in solution. Considering the separation power of the mass 
spectrometer, the peak capacity of the four-dimensional system is 
greater than 100,000 for proteins below approximately 25 kDa 
(Supplementary Information). This is 20-fold higher than the peak 
capacity for high-resolution two-dimensional gels (less than 5,000). 
Identification and characterization of isoforms were achieved using 
fragmentation data acquired with less than 10 part-per-million mass 
accuracy for searching databases with highly annotated primary 
sequences'®. Using tailored software’’, we overcame the ‘protein infer- 
ence problem’ where identification ambiguity results when isoforms 
(for example, from members of a gene family or alternative splicing) 
produce many identical tryptic peptides*'’. The databases and search 
engine used here are fully compatible with the UniProt flat file format 
and enable a deep consideration of known PTMs, alternative splice 
variants, polymorphisms, endogenous proteolysis and diverse combi- 
nations of all these sources of molecular variation at the protein level’®. 
Together with the careful curation of the Swiss-Prot database’®, the 
result is an informatics framework that maps each given protein iden- 
tification to a single gene (except in rare cases like ubiquitin, where 
multiple genes can produce the identical sequence). Extended details 
on statistical analysis are provided in the Methods section. 

A total of 1,043 proteins were identified with unique Swiss-Prot 
accession numbers in this study (Supplementary Table 1). These iden- 
tifications originate from 1,045 human genes, 77% of whose protein 
products displayed amino (N)-terminal acetylation. The distribution of 
q values, which indicates the confidence of protein identifications (see 
Methods), is shown in Fig. 3c. This level of proteome coverage repre- 
sents the most comprehensive implementation of top-down mass 
spectrometry so far, with an approximate tenfold increase in identifica- 
tions of intact proteins for any microbial system’? and a greater than 
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scans were extracted from the heat map for electrospray ionization—mass 
spectrometry spectra of high (1), medium (2) and low (3) mass proteins, along 
with their identifications from online fragmentation (insets at bottom). b, Plot 
created from selective display of protein pairs with mass differences consistent 
with acetylation (yellow), phosphorylation (red) and methylation (green), with 
three and two protein species shown as examples in insets (4) and (5), 
respectively. 


20-fold increase over any previous work in mammalian cells'*”’ (Fig. 3a). 
In addition, fragmentation evidence for 3,093 protein isoforms/species 
was captured in this initial report (Supplementary Table 1), with PTMs 
detected as follows: 645 phosphorylations, 538 lysine acetylations, 158 
methylations, 19 lipid/terpenes and 5 hypusines. Over 400 species were 
attributed to core histones alone. Comparisons of predicted protein 
hydrophobicity and isoelectric point showed minimal bias compared 
with that expected for the human proteome (Supplementary Fig. 2). 

Using an orthogonal method to detect PTMs based on intact mass 
values’’, we detected pairs of protein species showing characteristic 
mass differences (Fig. 2b). For proteins less than 20 kDa, 225 pairs 
showed mass differences consistent within 0.05Da with mono- 
methylation, 185 with di-methylation and 122 with tri-methylation/ 
acetylation. Other mass differences revealed 87 cases consistent with 
double acetylation, 140 with mono-phosphorylation and 100 with di- 
phosphorylation events (Fig. 2b). Using this set of mass differences on 
the entire HeLa data set for all isotopically resolved proteins, a total of 
2,130 such mass shifts were found. 

Complete characterization of a protein requires the theoretical and 
experimental mass values to match within error. For the 1,043 proteins 
identified, 431 and 331 were identified with intact mass information 
from either isotope spacings or deconvolution of charge states, respect- 
ively. Of these data, 54% of the isotopically resolved proteins matched 
the species identified from the database within 2 Da (Supplementary 
Fig. 3a). Likewise, 130 of 331 of the masses determined by deconvolu- 
tion were manually determined to be of high quality and 51% of these 
matched within 200 Da (Supplementary Fig. 3b). The protein species 
outside these windows are clearly identified by fragmentation, but 
harbour unexplained mass discrepancies (Am) at this time. The com- 
plete explanation of Am in the human proteome motivates future 
refinements in data acquisition to obtain enough tandem mass spec- 
trometry information on all the protein isoforms/species. 

Major functional differences can exist among protein isoforms in 
a family, making their precise identification a major boost in the 
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Figure 3 | Proteome analysis metrics associated with this study. a, Graph 

showing the striking increase in identifications from previous studies achieved 
in archaeal, bacterial, yeast or human systems. References cited are indicated 
above each bar. b, A gene ontology analysis for the identifications in this study. 


information content of proteomic analyses in higher eukaryotes. An 
intact protein mass and matching fragment ions from both termini are 
usually sufficient to accomplish a gene-specific identification*'”. Here, 9 
of the approximately 15 isoforms of histone H2A were fully character- 
ized in an automated fashion despite their greater than 95% sequence 
identity (including the H2A.Z and H2A.X variants) with an additional 
three having Am greater than 1 Da (H2A type 1-D, 2-C and 2-B). Also 
identified were nine S100 proteins, several «- and B-tubulins, seven 
unique isoforms of human keratin (a widely known contaminant in 
proteomics), MLC20, BTF3 and their related sequences (which are 97% 
and 81% identical, respectively (Supplementary Figs 4 and 5)), and over 
100 isoforms/species from the high mobility group (HMG) family (see, 
for example, Fig. 4). Significant improvements for top-down proteo- 
mics in discovery mode were made for proteins in the 40-110 kDa 
range (Fig. 3d), including extensive characterization of GRP78, a 
70.6-kDa heat-shock protein (more than 12 fragment ions mapping 
to each terminus, Supplementary Fig. 6), and identification of several 
proteins greater than 90 kDa, such as P33991 and Q14697 at 97 and 
104 kDa, respectively (Supplementary Table 2). 
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c, Histogram showing the distribution of q values for the identified proteins. 
d, Plot showing the molecular mass distribution for the unique identifications 
obtained. The line graph depicts the theoretical molecular mass distribution for 
the human proteome (Swiss-Prot, Homo sapiens, 20,223 entries). 


Because the two-dimensional liquid electrophoresis platform makes 
use of SDS extensively, we anticipated reduced bias against integral 
membrane proteins. In all, 32% of the 1,043 total identifications from 
HeLa cells were membrane-associated proteins (GO:0016020), with 
62% of these annotated as integral membrane proteins (GO:0016021, 
Supplementary Table 4). A more focused study of a mitochondrial 
membrane fraction (see Methods) used chromatographic procedures 
modified for enhanced separation of membrane proteins. We iden- 
tified an additional 46 integral membrane proteins (Supplementary 
Table 3) from a single three-dimensional experiment (no isoelectric 
focusing). Detailed inspection of the species that eluted from the col- 
umn during LC-MS revealed proteins with a distribution of 1-11 
transmembrane helices (Supplementary Table 3). This shows a broad 
applicability of this study and will drive further efforts to detect full- 
length isoforms of membrane proteins”. 

As part of our study of the HeLa proteome, cells were treated with 
etoposide to elicit the DNA damage response (see Methods), followed 
by four-dimensional fractionation and top-down tandem mass 
spectrometry. Using gene ontology (GO) analysis, we annotated all 
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Figure 4 | Monitoring dynamics of HMGAI isoforms during senescence in 
B16F10 and H1299 cells. After induction of DNA damage by transient 
treatment with camptothecin for H1299 cells or etoposide for B16F10, 
progression of accelerated senescence was monitored by SA-B-Gal (a, b) or 
DAPI staining to monitor formation of senescence-associated heterochromatic 
foci (SAHF) (c, d) over the specified recovery period. Changes in modification 
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profiles on HMGAIa (e, f) and HMGA1b (g, h) from B16F10 showed mild 
increases in phosphorylation occupancy but a significant increase in 
methylation levels on multiply phosphorylated species. A more striking 
increase in both methylation and phosphorylation was observed in senescent 
H1299 cells (i, j). No such methylations were observed in the HMGA1b profiles 
for either cell line. 
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four-dimensional identifications according to cell compartment 
(Fig. 3b) or biological process (Supplementary Fig. 7). Many proteins 
detected were involved in cell cycle regulation and apoptosis, including 
nine that interact with proliferating cell nuclear antigen during repair 
of DNA damage (Supplementary Fig. 8). Also, several proteins involved 
in the Fanconi anaemia pathway were identified including FANCE, 
RADS1AP1, RAD23B and RPA3, with the last two completely char- 
acterized (Supplementary Table 5). Several CDK inhibitors were found, 
such as p27“?! (CDKNIB) and p16'““ (CDKN2A), T53G1 and the 
protein product from a target gene of p53 (Q9Y2A0, p53-activated 
protein 1). 

Using the three-dimensional fractionation approach (that is, 
GELFrEE-nanocapillary LC-MS) to readout phosphorylation stoichi- 
ometry with high fidelity (Supplementary Information and Sup- 
plementary Fig. 9), we monitored 17 phosphoprotein targets across 
three time points at three different concentrations of etoposide (Sup- 
plementary Table 6). We found increases in the occupancy of phos- 
phorylation in H2A.X-pSer139 (yH2A.X) after treatment with 25 or 
100 1M etoposide for 1 h (Supplementary Fig. 10). After a 24-h recovery 
from treatment, a return to basal levels of phosphorylation of yYH2A.X 
was found, consistent with engagement of the DNA-repair machinery™*. 
Further, we observed a strong correlation between the phosphorylation 
stoichiometry of yYH2A.X determined by mass spectrometry with the 
results from immunofluorescence and western blotting run in parallel 
(Supplementary Fig. 10a-c). 

In separate studies we tracked over 2,300 species (from 690 proteins) 
in H1299 cells (Supplementary Table 7) and 2,300 species (from 708 
proteins) in B16F10 melanoma cells (Supplementary Table 8) in the 
days after a 24-h treatment with camptothecin or 5h of etoposide, 
respectively, using only the three-dimensional fractionation approach. 
After induction of DNA damage, we also monitored the classic hall- 
marks of stress-induced senescence in H1299 (ref. 25) and B16F10 
(ref. 26) over several days (Supplementary Fig. 1la-c), including cell 
enlargement and formation of senescence-associated heterochromatic 
foci (Supplementary Fig. 11d-f). Although levels of yYH2A.X remained 
the sameas in control cells, a striking upregulation in methylated forms 
of di- and tri-phosphorylated HMGA 1a, but not of its splice variant 
HMGA\I1b was observed as both B16F10 and H1299 cells entered 
stress-induced senescence (Fig. 4 and Supplementary Fig. 11g-l). 

Full descriptions of the fragmentation data for two multiply 
modified species of HMGAI are presented in Supplementary Fig. 12. 
In mapping these species, the hierarchy of phosphorylations on 
HMGA Ia was determined for control cells to be Ser 101 and Ser 102 
occupied in the 2P; form and evidence for the third site pointing pre- 
dominantly towards pSer 98. The 3P; and 4P; forms both showed some 
occupancy for pSer 43 (data not shown), a site only available in the 
splice region specific to the HMGA 1a variant (Supplementary Fig. 12). 
For day 5 in senescent H1299 cells, the effect on methylation was 
particularly dramatic, with both the mono- and di-methylated species 
(also harbouring multiple phosphorylations) reproducibly increased 
to be greater than 80% of the total signal for species from the hmga1 
gene (Fig. 4 and see Supplementary Fig. 13 for biological replicates). 
The methylation site was localized precisely to Arg 25 (Supplementary 
Fig. 12), consistent with previous work on HMGAI proteins”. A 
similar response for methylated HMGA species has been observed in 
damaged cancer cells undergoing apoptosis””’® but the B16F10 and 
H1299 cells prepared here were clearly senescent as measured by 
annexin V staining and fluorescence-activated cell sorting analysis 
through day 6 (data not shown). As Arg25 is in the first AT- 
hook DNA-binding region (residues 21-31), it is possible that the 
R25mel and R25me2 marks perturb DNA-kinking and allow 
HMGA Ia to be preferentially incorporated into senescence-associated 
heterochromatic foci” during accelerated cellular senescence. Other 
changes in bulk chromatin were also notable, such as hypoacetylation 
on all core histones, increased levels of H3.2K27me2/3, and decreased 
H3.2K36me3. 
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The sharp increase in proteome coverage demonstrated here pro- 
vides a path ahead for interrogating the natural complexity of protein 
primary structures that exist within human cells and tissues. Because 
this is the first time top-down proteomics has been achieved at this 
scale, an early glimpse at the prevalence of uncharacterized mass shift- 
ing events has been revealed in the human proteome. With faithful 
mapping of intact isoforms on a proteomic scale, detecting co-variance 
in modification patterns will help lay bare the post-translational logic 
of intracellular signalling. Also, proper speciation of protein molecules 
offers the promise of increased efficiency for biomarker discovery 
through stronger correlations between measurements and organismal 
phenotype (for example, a particular isoform of apolipoprotin 
C-III and levels of high-density lipoprotein in human blood’). 
Technology for intact protein characterization could also become a 
central approach to focus an analogous effort to the human genome 
project—to provide a definitive description of protein molecules pre- 
sent in the human body”. 


METHODS SUMMARY 


For large-scale global analysis, HeLa S3 cells were pre-fractionated using custom 
two-dimensional liquid electrophoresis platform, comprising sIEF coupled to 
multiplexed GELFrEE’'’. HeLa S3, H1299, B16F10 cells and mitochondrial 
membrane proteins were also fractionated using the custom GELFrEE'’ device 
alone (no sIEF). After separation, detergent and salt were removed, and the frac- 
tions were injected into nanocapillary reversed-phase liquid chromatography 
(RPLC) columns for elution into a 12T linear ion trap Fourier-transform mass 
spectrometer for online detection and fragmentation’*’*. The mass spectrometry 
RAW files were processed with in-house software called CRAWLER to assign 
masses. Using this program, determination of both the intact masses and the 
corresponding fragment masses was performed and these data were searched 
against a human proteome database. Extensive statistical workups were also per- 
formed using several false discovery rate (FDR) estimation approaches (with decoy 
databases both concatenated and not). A final q value procedure is described in 
detail (Methods), with the data above reported using a 5% instantaneous FDR 
(that is, q value) cutoff at the protein level (Supplementary Fig. 14). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culturing and treatments. HeLa S3 cells (ATCC CCL-2.2) were grown in 
Joklik’s modified minimal essential medium. HeLa cells were grown in suspension, 
whereas B16F10 mouse melanoma cells (ATCC CRL-6475) and H1299 small lung 
carcinoma cells (ATCC CRL-5803, in DMEM) were grown adherently. Media 
were supplemented with 10% newborn calf serum and 1% penicillin and 
streptomycin. Cells were maintained in 5% CO, at 37 °C, harvested (after adding 
0.05% trypsin and EDTA for B16F10 and H1299 cells) by centrifugation at 200g 
for 5 min, and washed twice with PBS. 

For experiments using HeLa with intrinsic DNA damage, cells were treated for 

1 or 5h using 1 [tM etoposide (four-dimensional platform), or for 1h with 25 or 
100 uM etoposide for targeted experiments (three-dimensional platform, Sup- 
plementary Fig. 10 and Supplementary Table 6)*'. Repair of DNA damage was 
monitored after placing treated (25 1M) cells into fresh media for 24h before 
harvesting. B16F10 cells were treated with 10 1M etoposide for 5h at 20-30% 
confluence, and allowed to grow in normal media. Similarly, H1299 cells were 
treated with 25nM camptothecin for 24h. Over several days, these treatments 
induced stress-associated, accelerated senescence as monitored by” a flattened 
and enlarged cell morphology, expression of senescence-associated -galactosidase 
(SA-B-gal), formation of senescence-associated heterochromatic foci and upregu- 
lation of p53 (in B16F10 cells only). Approximately 2 X 10’ senescent cells were 
collected, lysed and subjected to the three-dimensional platform. 
Preparation of HeLa S3 cytosolic, nuclear and whole-cell extracts. For large- 
scale mapping using the four-dimensional platform, the HeLa cytosolic and 
nuclear extracts were prepared through a protocol by Trinkle-Mulcahy et al.*’. 
After isolation, the pelleted fraction containing nuclei was redissolved using 4% 
SDS (50mM Tris, pH 7.5, with protease, phosphatase inhibitors and sodium 
butyrate). Both cytosolic and nuclear fractions were disrupted using a sonication 
probe. The nuclear fractions were further homogenized using QIAShredder 
homogenizer spin columns to reduce viscosity (Qiagen). All fractions were cen- 
trifuged at 14,000g for 10 min at 4°C. Protein concentrations were determined by 
bicinchoninic acid and stored at —80 °C. 

For the three-dimensional platform, whole-cell extracts were re-suspended in 

5 ml of lysis buffer (4% SDS, 100 mM Tris-HCl pH 7.5, 10 mM DTT with protease, 
phosphatase inhibitors and sodium butyrate). The mixture was vortexed for 5 min 
and boiled for 10 min. Immediately after boiling, the samples were alkylated in the 
dark with 100 mM iodoacetamide for 20 min. 
Western blots, imaging and microscopy. Western Blots. Antibodies used were 
Histone H2A.X-pSer139 (Cell Signaling Technology; 2577S), GAPDH (Santa Cruz; 
sc-47724) and HRP-conjugated secondary antibodies. Chemiluminescence was 
detected using ChemiDoc XRS+ (Bio-Rad Laboratories) and band densities were 
calculated using Image Lab software (Bio-Rad Laboratories). 

For yH2A.X imaging by immunofluorescence, cells were fixed in 0.1% 
glutaraldehyde and 3% formaldehyde made fresh from paraformaldehyde fol- 
lowed by permeabilization in 0.5% Triton X-100. Cells were subsequently incu- 
bated for 1h at 25°C with 3% BSA and then with the primary antibody against 
Histone H2A.X-pSer139 (1:400). Alexa Fluor Conjugates (Molecular Probes) were 
used as the secondary antibodies (1:1,000). Images were obtained using an inverted 
Zeiss Axio Observer.Z1 confocal microscope. 

For {-Gal staining, cells were fixed for 5min in 2% formaldehyde/0.2% 
glutaraldehyde in PBS, washed and incubated at 37 °C (no CO,) overnight with 
X-Gal (5-bromo-4-chloro-3-indolyl B-p-glactosidase) staining solution (1 mg of 
X-Gal in 40 mM citric acid/sodium phosphate pH 6.0, 5 mM potassium ferrocyanide, 
5 mM potassium ferricyanide, 150 mM NaCl, 2mM MgCl). 

For DAPI staining, cells were fixed in 4% paraformaldehyde for 30 min, washed 
and incubated with 4’ ,6’-diamidino-2-phenylindole (DAPI) solution (10 pg ml ty 
for 10 min. The images of DAPI-stained DNA were obtained using the Zeiss Axio 
Observer.Z1 confocal microscope. 

Sample handling and multidimensional protein fractionation. Fractionation 
using four-dimensional fractionation (two-dimensional liquid electrophoresis and 
LC-MS). HeLa proteins (0.5-2 mg) were reduced, alkylated, precipitated with cold 
acetone and re-suspended in 3.2 ml sIEF buffer (8 M urea, 2 M thiourea, 50 mM 
DTT, 1% w/v Biolyte 3/10 carrier ampholytes from Bio-Rad Laboratories). The 
sample was focused using a custom designed eight-chamber sIEF system as previ- 
ously described**. After complete focusing (approximately 1.5 h at 2 W), the liquid 
fractions (400 tl) were collected and combined with their respective chamber rinse 
solution (100 ul of 1% SDS). Adjacent sIEF fractions (including anode and cath- 
ode) were pooled, resulting in about five fractions that were precipitated using cold 
acetone. The precipitated proteins were re-suspended in approximately 50 pl of 
Laemmli loading buffer**. These fractions were then fractionated in parallel using a 
custom ten-channel multiplexed GELFrEE device'*. Tube gels were cast to 12% T 
(1cm length) for the resolving and 4% T for the stacking gels (300 ,1l volume). 
Application of 240 V for approximately 1h resulted in eight or nine GELFrEE 
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fractions (150 pl) per IEF fraction after elution of the dye front. After complete 
electroelution, the two-dimensional liquid electrophoresis fractions underwent 
SDS removal using chloroform/methanol/water precipitation as described previ- 
ously*®. Before nanocapillary RPLC injection, fractions were re-suspended by 
pipetting vigorously with 15-40 ll solvent A (5% acetonitrile, 0.2% formic acid). 

For experiments using the three-dimensional platform, only GELFrEE coupled 
to nanocapillary LC-MS was used for sample fractionation. Whole cell lysates, 
mitochondrial membrane preparations or extracts targeting modified proteins 
were re-suspended as described above and fractionated using a single-channel 
GELFYEE device’’. Nanocapillary LC-MS conditions for large-scale analyses were 
as described below. 

For nanocapillary RPLC-MS, in either the four- or three-dimensional platform, 
re-suspended fractions were injected (10 11) onto a trap column (150 tm X 2 cm) 
using an autosampler (Eksigent). The nanobore analytical column (75 jum X 10 cm) 
containing an integral fritted nanospray emitter (PicoFrit, New Objective) was 
coupled to the trap in a vented column tee setup. Both the analytical and trap 
columns contain polymeric reversed-phase (PLRP-S, Phenomenex) media (5 jum, 
1,000A pore size). The Eksigent 1D Plus nano-HPLC system was operated at a flow 
rate of approximately 2 l min! for 10 min for loading onto the trap. The proteins 
were eluted into the mass spectrometer using a flow rate of 300 nl min” ' with the 
following gradient: 5% B (95% acetonitrile + 0.2% formic acid) at 0 min; 20% B at 
5 min; 55% B at 50 min; 85% B at 55 min; 5% B at 65 min; 5% B at 75 min. 

For proteins fractionated from HeLa $3 and B16F10 cells, the nanocapillary 

RPLC column was coupled online to a 12T LTQ FT Ultra mass spectrometer 
(Thermo Fisher Scientific) fitted with a digitally controlled nanospray ionization 
source (PicoView DPV-550, New Objective). For masses up to 25 kDa (as deter- 
mined from GELFrEE fractions), precursor mass data were collected using the 
Fourier-transform ion cyclotron resonance (eight microscans, 170,000 resolving 
power at m/z = 400) with an m/z range of 500-1,800 and a target value of 1 million 
charges. For masses either greater than 25 kDa or greater than 50 kDa, precursor 
mass data were obtained using the ion trap at 20 or 50 microscans, respectively. 
Data from H1299 cells were obtained on an Orbitrap Elite mass spectrometer 
(Thermo Fisher Scientific) using two to four microscans, 108,000 or 216,000 
resolving power at m/z = 400, and a target value of 1 million charges. 
Mass spectrometry data acquisition for targeted monitoring of intact 
isoforms/species. For three-dimensional experiments on targets up to 25 kDa, 
precursor mass data were collected with Fourier-transform ion cyclotron res- 
onance parameters as described above. Data-dependent ‘zoom mapping’ was 
performed using a top three (no fragmentation) acquisition strategy with 60 
m/z isolation window, three microscans at 85,000 resolving power (target value 
of 2 million charges with 60 m/z isolation at zero collision energy, or SIM mode). 
Dynamic exclusion was enabled with a repeat count of 2, an exclusion duration of 
5,000 s, and a repeat duration of 240s. ‘Mass mode’ was enabled in the Xcalibur 
software to ensure that each zoom map scan detected a different protein species. 
Preparation of mitochondrial membrane proteins for top-down mass 
spectrometry. A HeLa $3 cell pellet consisting of approximately 10” cells was 
re-suspended in about 16ml STM buffer (250mM sucrose, 50mM Tris-HCl, 
pH 7.4, 5mM MgCl, 10mM sodium butyrate, 1mM DTT, 1% protease and 
phosphatase inhibitors (Sigma-Aldrich)). Cells were lysed using a glass Dounce 
homogenizer. The lysate was centrifuged at 800g for 15 min to remove nuclei and 
cellular debris. Mitochondria membrane isolation was performed as described 
previously”’. Briefly, the cell lysate was centrifuged at 6,000g for 15 min. The pellet 
was washed with STM buffer and the centrifugation step repeated. Mitochondria 
were re-suspended in 2 ml lysis buffer (10 mM HEPES, pH 7.9, 10 mM sodium 
butyrate, 1 mM DTT, 1% protease and phosphatase inhibitors) and stirred at 4 °C 
for 30 min before sonication. The suspension was centrifuged at 9,000g for 30 min. 
The pellet was re-suspended in 0.5 ml extraction buffer (20 mM Tris, pH 7.8, 0.4M 
NaCl, 15% glycerol, 5% SDS, 10mM sodium butyrate, 1mM DTT, 1% protease 
and phosphatase inhibitors) and vortexed for 30 min. The sample was centrifuged 
at 9,000g for 30 min and the supernatant was collected, aliquoted, flash frozen and 
stored at —80 °C until use. 

For GELFrEE separation and LC-MS of integral membrane proteins, mito- 
chondrial membrane proteins (400 jug as determined by bicinchoninic acid) were 
acetone precipitated, re-suspended in 100 il loading buffer, reduced with 20 mM 
DTT and alkylated with 100 mM iodoacetamide. After a GELFrEE separation as 
reported previously’, SDS was removed” and the fractions re-suspended in 30 pl 
of fresh 60% formic acid. Each 10-l fraction was injected onto a PLRP analytical 
column (75 um X 10cm) and heated to 45 °C using a nanocapillary column heater 
(New Objective). A solvent system developed for integral membrane proteins was 
used in nanocapillary format here: A, 60% formic acid in water; B, 100% isopro- 
panol. The gradient used was 0% B at 0 min, 25% B at 5 min, 60% B at 50 min, 95% 
B at 56 min. Ten GELFrEE samples were also analysed by the standard acetonitrile 
solvent and gradient system described above. 
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TMHMM version 2.0 (http://www.cbs.dtu.dk/services/TMHMM) was used for 
the transmembrane domain prediction of identified proteins. The SwissProt acces- 
sion number for each identification was entered into the server to determine the 
number and location of all transmembrane domains. Differences between the 
database sequence and that identified by top-down mass spectrometry were 
accounted for by interrogating the identified sequence with TMHMM. 

Protein identification and characterization. Mass spectrometry fragmentation 
data were acquired in three different modes, depending on protein mass range. 
Below 17kDa, data-dependent collision-induced dissociation (CID) was used 
(FT/FT), whereas source-induced dissociation (SID) was used for masses between 
17 and 25kDa (FT/FT) and above 25 kDa (ion trap/FT). Based on preliminary 
analyses, SID of 15 V was optimal for ion trap scans for the dissociation of weakly 
bound non-covalent adducts, whereas the 75V SID for fragmentation was 
standardized as described’*. For data-dependent fragmentation (top two precur- 
sors, 15-25 m/z isolation window), dynamic exclusion was enabled with a repeat 
count of 2, an exclusion duration of 5,000 s, and a repeat duration of 240s. Both 
CID and SID were collected using eight microscans and 85,000 resolving power (at 
m/z = 400) with a target value of 2 million charges. Data for H1299 cells obtained 
on the Orbitrap Elite were acquired using either higher-energy collisional dissoci- 
ation, CID or electron transfer dissociation during top three data-dependent 
tandem mass spectrometry fragmentation using mass mode, four microscans, 
108,000 resolving power and a target value of 1 million charges (cf. Fig. 4i, j, 
Supplementary Figs 12 and 13 and Supplementary Table 7). 

Software and data analysis. Much of the software for intact mass determination 
(kKDECON), generation of visual outputs (Proteome Display), and species differ- 
entiation (PTMcrawler) has been published recently’”. Briefly, KDECON provides 
average mass information for charge state resolved distributions using the ion trap. 
PTMcrawler traverses a list of intact monoisotopic masses to find mass differences 
corresponding to PTMs such as methylations, acetylations and phosphorylations 
(Fig. 2b). Proteome Display was devised for visualization of nanocapillary LC- 
tandem mass spectrometry data, allowing graphical viewing of four-dimensional 
proteome runs and generation of plots for specific PTMs (Fig. 2). 

The RAW files collected were first processed with an algorithm called 
CRAWLER to assign masses. Using a version of this program, scans collected 
by Fourier-transform mass spectrometry were de-isotoped using the Xtract or 
THRASH algorithm’”’. Scans collected in the ion trap were deconvoluted using 
the kDECON algorithm (minimum intensity cutoff 1,000; mass range 10-70 kDa). 
Xtract or THRASH processing generated monoisotopic neutral masses, whereas 
kDECON processing provided average neutral masses. Data-dependent frag- 
mentation scans were summed within a retention time tolerance of 1 min and a 
precursor tolerance of 0.05 m/z whereas SID scans were summed in 0.3 min. In 
both cases, multiplexed fragmentation was considered. Fragmentation data were 
filtered by selecting the top three most intense neutral fragment masses within a 
100-Da window below 2,000 Da, and the top five above 2,000 Da. One or more 
precursor masses and one or more fragment masses resulting from each summed 
unit were grouped as ProSightPC experiment, and written to a ProSight Upload 
Format file. If a precursor mass could not be determined, the experiment was 
written out to a separate file with a placeholder value, for separate analysis. 

The ProSight Upload Format files output by CRAWLER were searched against a 
human proteome database using a custom implementation of ProSightPC 2.0 with 
iterative search logic on a 168-node Rocks” cluster. Four types of analysis were 
run, depending on the type of data: FT/FT CID, FT/FT SID, ion trap/FT SID or SID 
data where the precursor could not be determined (‘No-Hi-SID’). The iterative search 
trees were designed to take advantage of high mass accuracy, while retaining the 
option to run less specific searches ifa result of sufficient quality could not be obtained 
by more specific searches. All searches used 10 p.p.m. tolerance for the fragment ions, 
all of which were obtained at high resolving power. For each search, the top ten hits 
were returned; if the top hit had an E value not more than 1 X 10”, the analysis 
moved on to the next experiment, otherwise the next search in the tree was run. All 
searches were in absolute mass mode. The FT-FT-CID tree consisted of searches at 
200, 2,000 Da, and ‘entire database’ precursor tolerances; the FT-FT-SID at 2.3, 
2,000, 20,000 and entire database; and ion trap-FT-SID trees used 2,000, 20,000 
Da, and ‘entire database’ precursor tolerances; the No-Hi-SID tree just searched 
against the entire database. Searches were against two different human proteome 
databases built against UniProt release 2011_04, encompassing known alternative 
splices, modifications, peptide cleavage events, potential initial methionine cleavage 
and amino (N)-terminal acetylation. A complex database was created encompassing 
combinations of annotated alternative splice and peptide cleavage events to generate 
54,190 base sequences. A maximum of 2'° protein forms for each base sequence 
make approximately 8,450,000 theoretical protein species. This database was used for 
all searches where the precursor tolerance was less than 2,000 Da. All other searches 
used a simplified database consisting of the same 54,190 base sequences, modified 


with N-terminal acetylation and initial methionine cleavage (where applicable) 
creating a total of approximately 160,000 forms. 

Data were run against both forward and scrambled databases (see FDR estima- 
tion below) separately with identical search parameters. Upon completion, all 
search results were loaded into a ProSight data repository and a report was pro- 
duced, returning the top hits for each experiment. The hit with the best q value was 
then chosen as the exemplar for each gene product cluster. If two member hits had 
the same q value, the member with the lowest absolute mass difference to the 
theoretical hit was chosen, and if this still produced a set with more than one 
member, the form with the ‘most-terminal’ PTMs (that is, closest to the N or 
carboxy terminus) was chosen. For a desired FDR cutoff (for example, 5%), a list of 
accession numbers and species is produced. 

For estimation of FDR for top-down proteomics, the Poisson-based model as 
published in 2001 (ref. 43) had been previously modified with a Bonferroni cor- 
rection that enables probability-based scoring (that is, use of the E value noted 
above) for searches done on a database created by shotgun annotation’®. To 
validate the process used in high-throughput operations, an FDR analysis was 
performed to correct for multiple hypothesis testing using the method of 
Benjamini and Hochberg” as applied by Storey**. For each Poisson-based p value, 
a corresponding Bayesian posterior p value is calculated, termed the q value, which 
is a measure of the FDR for that particular identification event (also called an 
instantaneous FDR). 

To calculate q values, a separate decoy database of scrambled sequences was 
created equal in size to the forward (real) database*®. Searches were done separately 
on both the forward and decoy (scrambled) databases using all data in a set of four- 
or three-dimensional proteome runs. A histogram was created for the decoy 
database results using the log pjq value (where p;q is the Poisson probability of 
an incorrect protein identification) of these false identifications (Supplementary 
Fig. 14). These data were modelled against a gamma distribution and fit with a 
shape (k) of 10.26 + 0.04 and rate () of 2.25 + 0.01. The distribution of scrambled 
hits is taken as an empirical estimate of the distribution of scores under the null 
hypothesis that the match was due to chance. Thus, the area under the scrambled 
score distribution to the right of the observed forward score is the probability of 
getting this good a forward score, or better, by chance (abbreviated hereafter p,). 
From here, all data are rank-ordered by their corresponding p; values and q values 
are calculated as in Storey**. The final results were generated using a q value cutoff 
of 0.05, thus achieving a protein level FDR of 5%. For comparison, bottom-up 
studies typically use a 1% FDR cutoff at the peptide level (the so-called PSM level, 
for peptide spectrum match), which typically rolls up into a 5-8% FDR at the 
protein level’’. Extensive comparison with other FDR estimation techniques 
(including generating ROC curves for reversed decoy databases of concatenated 
reversed sequences) showed the q value approach to be 10-25% more stringent in 
terms of number of identified proteins and species. 
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The endonuclease activity of Mili fuels piRNA 
amplification that silences LINE] elements 


Serena De Fazio!, Nenad Bartonicek?*, Monica Di Giacomo!, Cei Abreu-Goodger’, Aditya Sankar’, Charlotta Funaya’, 
Claude Antony*, Pedro N. Moreira’, Anton J. Enright? & Donal O’Carroll' 


Piwi proteins and Piwi-interacting RNAs (piRNAs) have conserved 
functions in transposon silencing’. The murine Piwi proteins Mili 
and Miwi2 (also called Piwil2 and Piwil4, respectively) direct epi- 
genetic LINE1 and intracisternal A particle transposon silencing 
during genome reprogramming in the embryonic male germ 
line’*. Piwi proteins are proposed to be piRNA-guided endo- 
nucleases that initiate secondary piRNA biogenesis” ’; however, 
the actual contribution of their endonuclease activities to piRNA 
biogenesis and transposon silencing remain unknown. To investi- 
gate the role of Piwi-catalysed endonucleolytic activity, we engi- 
neered point mutations in mice that substitute the second aspartic 
acid to an alanine in the DDH catalytic triad of Mili and Miwi2, 
generating the Mili?” and Miwi2”“" alleles, respectively. Analysis 
of Mili-bound piRNAs from homozygous Mili?" fetal gonado- 
cytes revealed a failure of transposon piRNA amplification, result- 
ing in the marked reduction of piRNA bound within Miwi2 
ribonuclear particles. We find that Mili-mediated piRNA amplifica- 
tion is selectively required for LINE1, but not intracisternal A par- 
ticle, silencing. The defective piRNA pathway in Mili?“" mice 
results in spermatogenic failure and sterility. Surprisingly, homo- 
zygous Miwi2?4" mice are fertile, transposon silencing is estab- 
lished normally and no defects in secondary piRNA biogenesis are 
observed. In addition, the hallmarks of piRNA amplification are 
observed in Miwi2-deficient gonadocytes. We conclude that cycles 
of intra-Mili secondary piRNA biogenesis fuel piRNA amplification 
that is absolutely required for LINE] silencing. 

Transposable elements are mobile genetic elements that constitute a 
large fraction of eukaryotic genomes. The process of transposon silen- 
cing is of fundamental importance for genome integrity and germ cell 
development. Members of the Piwi subclade of the Argonaute proteins 
have conserved roles in transposon silencing and bind a class of small 
non-coding RNAs known as piRNAs that act as guides for targeting of 
the respective ribonuclear particles (RNPs)'. The Argonaute family is 
primarily defined by the presence of the Piwi domain that adopts a 
classical RNase H fold* with some Argonaute proteins being active 
small-RNA-guided endonucleases (slicers)’. Mechanisms of piRNA 
biogenesis are largely unclear, but two models summarize our current 
knowledge. First, primary processing of long single-stranded precursors 
by unknown nuclease(s) results in the generation of primary piRNAs. 
Thereafter, secondary biogenesis takes place via a ‘ping-pong’ cycle; 
that is, a feed-forward amplification loop wherein the slicer activities 
of two Piwi proteins take turns to generate secondary piRNAs using 
primary piRNAsas initial guides*®. Originally described in Drosophila, 
where all three Drosophila Piwi proteins have proven endonuclease 
activity in vitro®”°, this secondary pathway model offers an explana- 
tion for transposon silencing as piRNA biogenesis consumes trans- 
poson transcripts”®. Nevertheless, the role of Piwi-mediated slicing in 
secondary biogenesis or transposon silencing is not directly tested in 
any system. 


In mice, transposable elements are epigenetically silenced throughout 
most of life with transposon repression being initially established 
during germ cell development'’. The process of reprogramming in 
the fetal germ line initiates genome-wide CpG demethylation that 
erases both genomic imprints as well as retrotransposon promoter 
methylation’’. In male mice the Piwi proteins Mili and Miwi2 are 
essential for de novo DNA methylation and repression of (L1) and 
intracisternal A particle (IAP) retrotransposons**. The mouse 
piRNA model proposes that transposon-derived primary Mili-bound 
sense piRNAs initiate a round of secondary processing by targeting 
complementary antisense transposon transcripts for cleavage by Mili’s 
endonuclease activity’. The 3’ cleavage fragment is then 3’ processed, 
which results in the accumulation of secondary antisense piRNAs in 
Miwi2 that guide transposon DNA methylation. Furthermore, these 
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Figure 1 | The endonuclease activity of Mili is required for spermatogenesis 
and LI silencing. a, Testicular atrophy in Mili”“” mice. Testicular weights of 
3-month-old wild-type (WT) and Mili?“ mice are shown. b, Haematoxylin- 
and eosin-stained wild-type and Mili?“” testis sections from 3-month-old 
mice. The percentage of Mili?“ mice with the indicated phenotype is shown. 
c, Immunofluorescence using anti-L1 Orfl and anti-IAP Gag antibodies 
(green) and DAPI-stained DNA (blue) on wild-type and MiliP“" £16.5 fetal 
testis sections are shown. d, Western blot (WB) using anti-L1 Orfl and anti- 
tubulin antibodies on extracts from P10 wild-type and Mili”*" testes is shown. 
e, The expression levels of L1 and IAP were quantified by gRT-PCR from RNA 
derived from P10 wild-type and Mili”“” testes. Error bars indicate standard 
deviation from biological triplicates (n = 3). f, Methylation-sensitive Southern 
blot on Hpall-digested DNA extracted from P10 wild-type and Mili?“" testis 
using a L1 promoter probe. The arrowhead indicates the identity of the 
methylation-sensitive fragment. 
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antisense piRNA-programmed Miwi2 RNPs are also proposed to target 
transposon transcripts for cleavage followed by another round of 
secondary piRNA biogenesis that serves to amplify the primary trans- 
poson piRNAs’. Here we explore the contribution of the putative 
endonuclease activities of Mili and Miwi2 to the embryonic piRNA 
pathway and transposon silencing. 

We decided to use the same genetic strategy that has recently 
been used to explore the in vivo endonuclease function of Argonaute 
2 (Ago2)'*"*. Mutation of the second aspartic acid to an alanine in the 
catalytic triad DDH (the Ago2”° or Ago2”“" mutation/allele) abro- 
gates Ago2’s catalytic function without affecting protein stability and 
other functions of the protein in vivo’*'®. We first generated knock-in 
Mili?894/?8194 or Mili?“" mice (Supplementary Fig. la-c). The point 
mutation did not have an impact on Mili protein expression (Sup- 
plementary Fig. 1d). Mili localizes to the inter-mitochondrial cement, a 
part of the germ-cell-specific perinuclear nuage’’, and has been shown 
to interact with the Tudor domain protein Tdrd1 as well as the RNA 
helicases Mov10L1 and Mvh (refs 18-22). Co-localization of Mili with 
Tdrdl, Movl0L1 and Mvh was observed in fetal gonadocytes of 
wild-type and Mili?“ mice (Supplementary Fig. 2). Furthermore, 
interaction of Mili with Tdrd1 and Mov10L1 was confirmed in both 
wild-type and Mili?“" mice (Supplementary Table 1). In contrast to 
Mili, Tdrd1 or Mvh deficiency, normal inter-mitochondrial 
cement was observed in Mili?“" gonadocytes (Supplementary Fig. 3). 
Therefore, the Mili?“" point mutation does not have an impact on 


inter-mitochondrial cement integrity, expression levels or localization 
of Mili. 

Deletion of Mili or Miwi2 leads to arrest in meiotic prophase*”’, 
probably attributable to secondary consequences of defective trans- 
poson DNA methylation and de-repression. All Mili?“” testes were 
atrophic (Fig. la), with most (70%) of MiliP“" mice presenting the 
same meiotic phenotype as Mili ‘~ mice. Early pachytene was the 
most advanced stage of germ cell development observed in these mice 
(Fig. 1b and Supplementary Fig. 4). The other 30% of Mili?” mice 
presented a slightly milder phenotype. Tubules containing round and 
even elongating spermatids were observed, albeit with cells of aberrant 
morphology (Fig. 1b and Supplementary Fig. 5). All Mili?“ tubules 
were highly apoptotic (Supplementary Figs 4b and 5c). However, inde- 
pendent of the phenotype observed, all Mili?4" male mice are sterile 
(Supplementary Fig. le). The process of transposon de novo DNA 
methylation occurs during late gestation in mice, concluding in postnatal 
germ cells a few days after birth. De-repression of L1 elements was 
observed in Mili?“" embryonic day (E)16.5 gonadocytes and postnatal 
and adult germ cells (Fig. 1c-e and Supplementary Fig. 6a, b). With 
respect to the process of transposon silencing, no heterogeneity in adult 
Mili?" mice was observed (Supplementary Fig. 6), raising the 
possibility that the distinct adult spermatogenic phenotypes may arise 
from a genetic background effect or that Mili and its associated endo- 
nuclease activity may function in processes beyond transposon silen- 
cing in meiotic or post-meiotic cells. Notably, in contrast to Mili ‘~ 
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Figure 2 | piRNA amplification failure in Mili?“” mice. a, Mili RNPs were 
immunoprecipitated from E16.5 fetal testis of the indicated genotypes. 
Associated piRNAs were visualized by 5’ **P labelling, resolution on a 15% TBE 
urea gel and autoradiography. nt, nucleotide. b, Size profiles of cloned Mili- 
bound piRNAs from biological replicates of wild-type (blue) and Mili?P4¥ 
(pink) E16.5 fetal gonadocytes. c, Genomic annotation of cloned Mili-bound 
piRNAs as indicated from pairs of biological replicates of wild-type and 
MiliP4” £16.5 fetal gonadocytes. LTR, long terminal repeat; ncRNA, non- 
coding RNA. d, Mapping of Mili-bound piRNAs to the consensus of L1 (left) 
and IAP (right) elements. Positive and negative values indicate sense and 
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Distance between 5’ ends (nt) 


antisense piRNAs, respectively. Schematic representations of the respective 
elements are also shown (above). e, Percentage of piRNAs from wild-type and 
Mili?“" Mili RNPs with a U at the first position (1U) without A at position 10 
and an A at position 10 (10A) without a U at position 1 are shown for L1 and 
IAP elements. Error bars indicate standard error of the mean from the 
biological duplicates (n = 2). f, Ping-pong analysis of Mili-bound piRNAs from 
biological replicates of wild-type and Mili”“" E16.5 fetal gonadocytes. Relative 
frequency (y axis) of distances between 5’ ends (x axis) between complementary 
piRNAs for both L1 and IAP elements is shown. nt, nucleotides. 
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mice*“, IAPs are normally repressed in Mili?“” mice (Fig. 1c-e and 


Supplementary Fig. 6a, b). Accordingly, defective CpG DNA methyla- 
tion of L1 but not IAP elements was observed in postnatal germ cells at 
time points after the completion of the de novo DNA methylation 
process (Fig. 1f and Supplementary Fig. 6c). Collectively, these data 
indicate that the endonuclease activity of Mili is absolutely required for 
LI silencing. 

We next analysed Mili-bound piRNAs to understand the impact of 
the Mili?“" mutation on the piRNA pathway. Equal quantities of 
piRNA were observed in Mili RNPs from Mili?“" fetal gonadocytes 
(Fig. 2a). piRNA libraries from biological duplicates of Mili RNPs from 
wild-type and Mili?*" fetal gonadocytes were subjected to deep 
sequencing. 20.3% of piRNAs are longer in Mili?“"' versus wild-type 
gonadocytes (Fig. 2b). Among repetitive element piRNAs a 4.2- and 
5.1-fold reduction in the fraction of L1 and IAP piRNAs, respectively, 
are observed in Mili?" libraries (Fig. 2c). When we mapped the reads 
to the genome, no differences were observed in the genomic origin 
between wild-type and Mili?" piRNAs (Supplementary Fig. 7), indi- 
cating that the endonuclease activity of Mili is not required for the 
selective expression of specific piRNA clusters. We next performed a 
qualitative analysis of L1 and IAP piRNAs. For this analysis, we only 
consider unique reads and not their individual depths within libraries. 
No differences in L1 and IAP piRNA formation were observed in the 
wild-type and Mili>*” libraries (Supplementary Fig. 8a). We therefore 
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The precise endonucleolytic cleavage site of an Argonaute protein is 
at the nucleotide opposed to the tenth nucleotide from the 5’ end of the 
guide small-RNA**’’”. A 5'U (1U) bias is a characteristic of primary 
piRNAs, therefore when a primary piRNA directs cleavage of a target 
followed by 3’ processing, the secondary piRNA will contain a bias for 
A at position 10 (refs 5, 6). In addition, there is an overlap of 10 
nucleotides; this precise complementarity is detected when comparing 
distances between 5’ ends of complementary piRNAs and is known as 
the ping-pong signature*®. A second round of piRNA biogenesis 
fuelled by the secondary piRNA results in the amplification of the 
initiating primary piRNA. Analysis of Mili-bound piRNAs, now tak- 
ing depth into consideration, revealed reduced levels of both L1 and 
IAP piRNAs in Mili”*” fetal gonadocytes (Fig. 2c, d). The increase in 
1U and a decrease in 10A-containing piRNAs in Mili?" libraries are 
indicative of the failure of piRNA amplification (Fig. 2e). Notably, the 
ping-pong signature is lost for both L1 and IAP piRNAs in Mili?4” 
mice (Fig. 2f). Collectively, these data are consistent with a function of 
the endonuclease activity of Mili in the initiation of secondary proces- 
sing required for transposon piRNA amplification. 

The lack of secondary processing in Mili?“ gonadocytes resulted 
in the stark reduction of piRNA in Miwi2 RNPs (Fig. 3a). The addi- 
tional cycles required to generate libraries from Mili”“" gonadocytes 
supported the observation of a significant decrease in piRNAs within 
Miwi2 from Mili?*” gonadocytes (Supplementary Table 2). Miwi2 
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Figure 3 | Marked reduction of Miwi2-bound piRNAs in Mili?4” mice. 

a, Miwi2 RNPs were immunoprecipitated from E16.5 fetal testis of the 
indicated genotypes and piRNA represented as in Fig. 2a. b, Confocal 
projection images of indirect immunofluorescence with Tdrd9, Dcpla and 
Miwi2 antibodies (green) and DAPI-stained DNA (red) from E16.5 fetal testis 
of the indicated genotypes. c, Annotation of cloned Miwi2-bound piRNAs as 


indicated from biological replicates of wild-type and Mili?“” E16.5 fetal 
gonadocytes. d, Mapping of the Miwi2-bound piRNAs to the consensus of L1 
(left) and IAP (right) elements. Positive and negative values indicate sense and 
antisense piRNAs, respectively. Schematic representations of the respective 
elements are also shown. 
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Depla revealed intact piP-body formation in Mili gonadocytes 
(Fig. 3b). Furthermore, Miwi2 shows cytoplasmic foci and, consistent 
with the partial piRNA loading, retained nuclear localization in 
Mili?“" gonadocytes albeit with a lower staining intensity (Fig. 3b). 
Sequencing revealed a piRNA population of a normal size profile 
featuring an approximately 2-fold decrease in the percentage of both 
LINE- and IAP-associated piRNAs (Fig. 3c and Supplementary Fig. 
9a). However, when mapped to L1 and IAP consensus sequences, the 
Miwi2-bound piRNAs from Mili?" mice showed a pattern of piRNA 
formation equivalent to wild type (Fig. 3d and Supplementary Fig. 8b). 
Whereas Miwi2 bound sense and antisense transposon piRNAs in 
almost equal measure in both wild-type and Mili?“" gonadocytes 
(Supplementary Table 2), as previously described*’, the amplitude of 
the antisense piRNA peaks derived across L1 was diminished com- 
pared to those previously observed*’. This difference probably reflects 
experimental variation due to the antibody used or recent advances in 
library generation/sequencing. No difference in the genomic origin of 


Therefore, the endonuclease activity of Mili does not alter the identity 
of piRNAs that reside within Miwi2 RNPs, merely their quantity. 
The fact that Miwi2 is seeded with the appropriate but severely 
reduced quantities of piRNAs in Mili?" mice indicates that the 
potential endonuclease activity of Miwi2 cannot compensate for the 
loss of Mili’s slicer activity. To understand the contribution of Miwi2’s 
putative endonuclease activity to piRNA amplification and transposon 
silencing, we generated Miwi2?7!94/P7184 (Mfiwi2?4") and Miwi2 ‘— 
mice (Supplementary Figs 10 and 11). Surprisingly, Miwi2?4" mice 
are fertile with no defects observed in testis morphology (Fig. 4a). 
Furthermore, in contrast to Miwi2~'~ fetal gonadocytes**, both L1 
and IAP are normally repressed (Fig. 4b, c). In addition to transposon 
silencing, Miwi2 is specifically required for the maintenance of 
spermatogonial stem cells’. No loss of germ cells is observed in 
Miwi2? aged mice (9-12months) (Supplementary Fig. 12). 
Therefore, the Miwi2?“" mutation does not have an impact on any 
known physiological function of Miwi2. Accordingly, both Mili and 


cluster-related piRNAs was observed (Supplementary Fig. 9b). Miwi2 are normally loaded with piRNA in Miwi2“" fetal gonadocytes 
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Figure 4 | Normal spermatogenesis and transposon silencing in Miwi2?4" 


mice. a, Haematoxylin- and eosin-stained wild-type, Miwi2?“" and Miwi2 ‘~ 
testis sections from 3-month-old mice. b, Immunofluorescence using anti-L1 
Orfl and anti-IAP Gag antibodies (green) and DAPI-stained DNA (blue) on 
wild-type, Miwi2?“" and Miwi2‘~ E16.5 fetal testis sections are shown. 

c, Methylation-sensitive Southern blot on Hpall-digested DNA extracted from 
P10 wild-type, Miwi2?4" and Miwi2 '~ testis using a L1 promoter probe is 
shown. The arrow indicates the identity of the methylation-sensitive fragment. 
d, Mili (left) and Miwi2 (right) RNPs were immunoprecipitated from E16.5 
fetal testis of the indicated genotypes shown as in Fig. 2a. e, Mapping of the 
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Mili-bound (left) and Miwi2-bound (right) piRNAs to the consensus of L1. 
Positive and negative values indicate sense and antisense piRNAs, respectively. 
Schematic representation of L1 is shown (above). f, Ping-pong analysis of Mili- 
and Miwi2-bound piRNAs from biological replicates of wild-type and 
Miwi2?4" £16.5 fetal gonadocytes are shown. The frequency of the distance 
between 5’ ends of complementary piRNAs for L1 is presented as in Fig. 2e. 
g, Model. L1 element silencing is dependent upon Mili’s endonuclease activity 
for piRNA amplification. IAP silencing is dependent upon Mili and Miwi2 but 
independent of piRNA amplification. The small blue and green lines represent 
sense and antisense piRNAs, respectively. 
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(Fig. 4d). No qualitative or quantitative differences in piRNA origin or 
amplification were observed in Mili- or Miwi2-bound piRNAs from 
Miwi2?" fetal gonadocytes (Fig. 4e, fand Supplementary Figs 13-15). 
The lack of physiological and molecular phenotype in Miwi2?“" mice 
argues against the possibility that Miwi2-mediated cleavage events 
functionally contribute to either slicing of nuclear transposon tran- 
scripts or piRNA amplification necessary for the establishment of 
transposon silencing. In support of the latter conclusion, we find 
piRNA amplification as judged by the existence of a robust ping-pong 
signature in Mili-bound piRNAs from Miwi2 ‘~ fetal gonadocytes 
(Supplementary Fig. 16). 

We propose a revised model of mammalian embryonic piRNA func- 
tion, whereby Mili’s endonuclease activity initiates secondary piRNA 
processing with an intra-Mili ping-pong cycle fuelling piRNA amp- 
lification (Fig. 4g). Mili’s endonuclease activity is specifically required 
for the expansion of L1 and IAP piRNAs within Mili RNPs and the 
normal accumulation of all classes of piRNAs within Miwi2 RNPs. The 
defective piRNA pathway in Mili?“ mice results in the failure to 
repress specifically Ll, revealing distinct silencing requirements for 
the respective transposons. The establishment of L1- but not IAP- 
silencing is strictly dependent upon Mili’s endonuclease activity and 
secondary piRNA biogenesis (Fig. 4g). The sufficiency of primary 
piRNA processing in Mili?“ mice to direct Miwi2-mediated IAP 
repression illustrates fundamental differences in piRNA dosage 
required for silencing of the respective transposons. This difference 
may reflect their genomic burden—L1 occupies approximately 19% 
of the mouse genome whereas IAPs account for only 0.2% (ref. 28). 
Therefore, amplification of the L1 piRNA pool may be essential to 
program sufficient Miwi2 to target the prolific L1 element. In conclu- 
sion, we show that a single Piwi endonuclease supports piRNA amp- 
lification and distinct transposon silencing. It remains to be seen if these 
are conserved features of piRNA-mediated transposon silencing. 


METHODS SUMMARY 

Histology. Testes were fixed in Bouin’s fixative overnight at 4°C temperature, 
paraffin embedded and sectioned at 6-8-|1m thickness. Sections were then stained 
(haematoxylin and eosin or periodic acid Schiff) by routine methods. 
Immunofluorescence. For immunofluorescence, E16.5 fetal testes were freshly 
embedded in OCT, 6 tm sections cut and fixed in 4% paraformaldehyde. For Mili 
co-localization with Mvh, Tdrd1 and Mov10L1, sections were boiled twice for 5 min 
in 10mM pH6 sodium citrate solution for antigen retrieval. Sections were blocked 
for 30 min at room temperature in 10% normal donkey serum, 2% BSA and 0.1 M 
glycine. Primary antibody incubation was done overnight at 4°C in the blocking 
buffer. Anti-rabbit Alexa-488-conjugated (1:1,000; Invitrogen) and anti-mouse 
Alexa-546-conjugated (1:1,000; Invitrogen) antibodies were used as secondary 
antibodies. DAPI (3 pg ul?) (Sigma) was used to stain DNA. For Miwi2, L1 
Orfl and IAP staining, paraformaldehyde-fixed sections were permeabilized with 
0.1% Triton X-100 for 10 min and then blocked in TBS-T 10% normal donkey 
serum and processed as above. A Leica TCS SP5 confocal microscope was used to 
acquire images. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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Mouse strains. Mili’ allele: the Piwi domain of Mili is encoded in exons 20 to 
23. Mili aspartic acid 813, the second aspartic acid of the catalytic triad, is encoded 
in exon 21. To generate the Mili?*!*4 allele we replaced wild-type exon 21 with a 
mutant exon where the aspartic acid 813 codon is mutated to encode an alanine. 
A targeting construct was recombineered that contains homology arms and a 
frt flanked neomycin (neo) cassette 3’ of exon 21 that contains the Mili?874 
mutation. Southern blotting of the individual ES-cell-derived clone genomic 
Sacl-digested DNA with a 3’ external probe was used to identify homologous 
recombinants. A 9.2-kb DNA fragment corresponds to the wild-type Mili locus; 
integration of the neo’-frt flanked cassette 3’ of exon 21 introduces an additional 
Sacl site, thus decreasing the size of the SacI DNA fragment recognized to 8.0 kb in 
the targeted allele. Flp-mediated recombination and excision of the neo’-frt 
flanked cassette results in a 6-kb Sacl DNA fragment recognized by the external 
3’ probe, which is diagnostic of the Mili?” allele. 

Miwi2?" allele: the Piwi domain of Miwi2 is encoded in exons 14-19. 
Miwi2 aspartic acid 761, the second aspartic acid of the catalytic triad, is 
encoded in exon 17. To generate the Miwi2?7°! allele we replaced wild-type 
exon 17 with a mutant exon where the aspartic acid 761 codon is mutated to 
encode an alanine. A targeting construct was recombineered that contains 
homology arms and a frt flanked neomycin (neo) cassette 3’ of exon 17 that 
contains the Miwi2””°'* mutation. Southern blotting of the individual ES-cell- 
derived clone genomic BsrGI-digested DNA with an external 5’ probe was 
used to identify homologous recombinants. An 11.8-kb DNA fragment corre- 
sponds to the wild-type Miwi2 locus; integration of the neo’-frt flanked cassette 
3’ of exon 17 introduces an additional BsrGI site, thus decreasing the size of 
the BsrGI DNA fragment recognized to 8.4kb. Flp-mediated recombination 
and excision of the neo’-frt flanked cassette results in a 6.4-kb BsrG] DNA 
fragment recognized by the external 3’ probe, which is diagnostic of the 
Miwi2?"" allele. 

Miwi2 null allele: for the Miwi2~ allele, we flanked exon 17 with loxP sites. 
Cre-mediated deletion of exon 17 results in out-of-frame splicing between exon 
16 and 18, resulting in stop codons before the last exon, targeting the mutant 
transcript for nonsense-mediated decay”. Should a fraction of the mutant tran- 
script escape nonsense-mediated decay, a truncated protein would be made that 
lacks most of the Piwi domain and thus would probably be non-functional. To 
generate this allele, a targeting construct was generated that contains the same 
homology arms as the MiliP®!*4 construct, an frt flanked neo cassette with a loxP 
site 3’ of exon 17 anda second 5’ loxP site. Southern blotting of the individual ES- 
cell-derived clone genomic KpnI-digested DNA with an external 5’ probe was 
used to identify homologous recombinants. A 10.1-kb DNA fragment corre- 
sponds to the wild-type Miwi2 locus; integration of the neo’-frt2-loxP cassette 
3’ of exon 17 introduces an additional KpnI site, thus decreasing the size of the 
KpnI DNA fragment recognized to 8.0kb. Cre-mediated recombination and 
excision of exon 17 and neo” loxP flanked cassette results in a 5.3-kb KpnI 
DNA fragment recognized by the external 3’ probe, which is diagnostic of the 
Miwi2 null (Miwi2_ ) allele. 

The Miwi2 targeting constructs and the Mili targeting construct were electro- 
porated into IB10° and A9 ES cells, respectively. A9 ES cells are derived from 
hybrid embryos resulting from a 129/Sv male by C57BL/6 female cross (A. Wutz, 
manuscript in preparation). Southern blotting as described above of the individual 
ES-cell-derived clones was used to identify homologous recombinants. IB10- 
targeted ES cells were used to generate chimaeras for the respective Miwi2 targeted 
alleles by standard blastocyst injections. A9-targeted ES cells were injected into 
C57BL/6 8-cell-stage embryos for the generation of fully ES-cell-derived mice 
following a procedure similar to one described previously*’ but using a Piezo 
Impact Unit (PMM150FU, Prime Tech) rather than a laser to puncture the zona 
pellucida of the host embryo. The Mili?“ and Miwi2“"-targeted mice were then 
crossed to the FLP-expressing transgenic mice (FLPeR)* to remove the frt flanked 
neo’ cassette, resulting in the generation of Mili?“# and Miwi2?4# alleles, respec- 
tively. Mice heterozygous for the loxP flanked Miwi2-targeted allele were crossed 
to Deleter Cre*’ to generate the Miwi2 null (Miwi2_ ) allele. The mice analysed in 
this study were on a mixed C57BL/6 and 129 genetic background. 

All of the mice were bred and maintained in EMBL Mouse Biology Unit, 
Monterotondo in accordance with current Italian legislation (Art. 9, 27 January 
1992, number 116) under license from the Italian health ministry. 

Antibodies. Rabbit polyclonal antibodies against mouse Miwi2 were generated 
using the same epitope as described previously* and used for immunoprecipita- 
tion and immunofluorescence (1:200) experiments. A mouse monoclonal anti- 
body against Mili was obtained from R. Pillai and used for immunoprecipitation 
and immunofluorescence (1:1,000) experiments. The following antibodies were 
used at the indicated dilutions for immunofluorescence: anti-Orfl L1 (S. Martin; 
1:250), anti-[AP Gag (B. Cullen; 1:500), anti-Mvh (Abcam (ab13840); 1:200), 


anti-Mov10L1 (R Pillai; 1:200), anti-Tdrd1 (R. Pillai; 1:200), anti-Tdrd9 (S. 
Chuma; 1:200) and anti-Dcpla (J. Lykke-Andersen; 1:500). 
Histology. Testes were fixed in Bouin’s fixative overnight at 4°C temperature, 
paraffin embedded and sectioned at 6-8-|1m thickness. Sections were then stained 
(haematoxylin and eosin or periodic acid Schiff) by routine methods. 
Immunofluorescence. For immunofluorescence, E16.5 fetal testes were freshly 
embedded in OCT, 6 um sections cut and fixed in 4% paraformaldehyde. For Mili 
co-localization with Mvh, Tdrd1 and Mov10I, sections were boiled twice for 5 min 
in 10mM pH 6 sodium citrate solution for antigen retrieval. Sections were blocked 
for 30 min at room temperature in 10% normal donkey serum, 2% BSA and 0.1 M 
glycine. Primary antibody incubation was done overnight at 4 °C in the blocking 
buffer. Anti-rabbit Alexa-488-conjugated (1:,1000; Invitrogen) and anti-mouse 
Alexa-546-conjugated (1:1,000; Invitrogen) antibodies were used as secondary 
antibodies. DAPI (3 pg ul?) (Sigma) was used to stain DNA. For Miwi2, L1 
Orfl and IAP staining, paraformaldehyde-fixed sections were permeabilized with 
0.1% Triton X-100 for 10 min and then blocked in TBS-T 10% normal donkey 
serum and processed as above. 

A Leica TCS SP5 confocal microscope was used to acquire images. 
Detection of apoptotic cells. Detection of apoptotic cells was performed on 
paraformaldehyde-fixed paraffin-embedded testis section using the in situ cell 
death detection kit (Roche) and developed with DAB substrate (Roche). 
Sections were also stained with haematoxylin. 
Electron microscopy. Fetal testes were fixed in 2.5% glutaraldehyde in 50 mM 
cacodylate buffer supplemented with 2% sucrose, 50 mM KCl, 2.6 mM CaCl, and 
2.6mM MgCl, for 30 min at 4°C and rinsed in 50 mM cacodylate buffer. Samples 
were incubated in 2% osmium in 50 mM cacodylate buffer for 40 min on ice, rinsed 
in water and incubated in 0.5% uranylacetate in water for 30 min on ice. The contrast 
enhancement procedure was followed by a stepwise dehydration in ethanol, up to 
100% ethanol and infiltration in EPON (Roth) for embedding. Polymerization was 
done at 60 °C. Ultrathin sections (60-70-nm thickness) of the testis were obtained 
using an ultramicrotome (Leica Microsystems) and the sections were mounted on 
formvar-coated slot grids, and contrasted with uranylacetate and lead citrate. The 
sections were then viewed in a CM120 biotwin electron microscope (FEI) operating 
at 100 kV. Digital acquisitions were made with a Keen View CCD camera (Soft 
Imaging System). 
Germ cell isolation. The previously described Oct4-GFP (ref. 34) allele was 
crossed into Mili?*"" and Miwi2?4" mice to label germ cells with eGFP. To isolate 
E16.5 and postnatal day 7 germ cells, single cell suspensions of testis were obtained 
by two-step enzymatic digestion and GFP-positive FACS sorted. 
CpG methylation analysis. Methylation-sensitive Southern blotting and bisulphite 
methylation analysis were performed as described’. 
Mass spectrometry. Mili was immunoprecipitated as described’, resolved on gel 
and the entire lane with the exception of the immunoglobulin fragments was 
subjected to liquid chromatography coupled to tandem mass spectrometry (LC- 
MSMS) on a LTQ Orbitrap Velos (Thermo Fisher Scientific) instrument. 
RT-qPCR analysis. Total RNA was isolated from 10 day post-partum testis using 
Trizol according to the manufacturer’s instruction and treated with Turbo DNase 
RNase-free (Ambion). cDNA synthesis was performed with SuperScript III 
Reverse Transcriptase (Invitrogen) with Random Hexamers (Invitrogen). 
Quantitative PCR was carried out by using SYBR Green I Master mix (Roche) 
on LightCycler 480 system (Roche). Three animals for each genotype were 
examined and assays were always done in triplicate. Primers for qPCR were used 
as described’. 
Small RNA library generation. Mili and Miwi2 RNPs were immunoprecipitated 
as described'’. Small RNA libraries were generated as described previously*’ but 
using adaptors suitable for sequencing on the Ilumina platform. 
Data analysis. Sequencing data were processed from the FASTQ format and 
analysed using R/Bioconductor*®. Barcode sequences were resolved by sample 
with no mismatches allowed, 5'/3' adaptors were stripped, sequences were filtered 
for low-complexity regions and finally size-selected for reads between 24-30 nt. 
Processed reads for each sample were mapped against the mouse genome 
(NCBIm37) using Bowtie 0.12.5 (ref. 37) allowing for two mismatches and 
requesting all matching sites. For reads mapping to multiple distinct loci, only 
the first 100 were reported according to match score. Mapped reads were categorized 
according to genomic annotations from Ensembl** Mouse v58 (LTR, LINE, SINE, 
genic, non-coding RNA). Reads not mapping to any recorded genomic element 
were classed as ‘other’. Counts for reads mapping to multiple loci were divided by the 
total number of loci. In the specific cases of LINE and IAP elements, all reads were 
also mapped against representative canonical sequences obtained from GenBank” 
(M13002.1, EU183301.1) using Bowtie allowing for up to three mismatches. The 
number of repeat mapping reads was divided by repeat length and the number of 
genome mapping reads per billion processed reads. Genome-wide visualizations of 
read mapping to the mouse genome were obtained using Circos”’ plots and were 
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scaled between each set of four samples according to library size (see figure legend). 
Line-track y-axis maxima are set to one-tenth of the highest peak from the library 
with most reads. Heat-map minima were set to one-half the average read count 
across all bins and maxima to one-half the maximum read count across all bins. 
Heat-map scaling across colour-space was performed using the ‘scale_log_base = 5° 
parameter of Circos to sample colour space better. 

For ping-pong analysis, only reads mapping to repeat elements were considered. 
For each pair of sense/antisense overlapping reads, the distance between their 5’ 
ends was recorded and counts were represented as relative frequencies within 
samples for each repeat element. 
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Miwi catalysis is required for piRNA amplification- 
independent LINE] transposon silencing 


Michael Reuter”, Philipp Berninger’’, Shinichiro Chuma**, Hardik Shah°, Mihoko Hosokawa**, Charlotta Funaya’®, 


Claude Antony®, Ravi Sachidanandam? & Ramesh S. Pillai? 


Repetitive-element-derived Piwi-interacting RNAs (piRNAs)’? act 
together with Piwi proteins Mili (also known as Piwil2) and Miwi2 
(also known as Piwil4) in a genome defence mechanism that initi- 
ates transposon silencing via DNA methylation in the mouse male 
embryonic germ line. This silencing depends on the participation 
of the Piwi proteins in a slicer-dependent piRNA amplification 
pathway and is essential for male fertility**. A third Piwi family 
member, Miwi (also known as Piwill), is expressed in specific 
postnatal germ cells and associates with a unique set of piRNAs 
of unknown function*®’. Here we show that Miwi is a small RNA- 
guided RNase (slicer) that requires extensive complementarity for 
target cleavage in vitro. Disruption of its catalytic activity in mice 
bya single point mutation causes male infertility, and mutant germ 
cells show increased accumulation of LINE1 retrotransposon tran- 
scripts. We provide evidence for Miwi slicer activity directly cleav- 
ing transposon messenger RNAs, offering an explanation for the 
continued maintenance of repeat-derived piRNAs long after trans- 
poson silencing is established in germline stem cells. Furthermore, 
our study supports a slicer-dependent silencing mechanism that 
functions without piRNA amplification. Thus, Piwi proteins seem 
to act in a two-pronged mammalian transposon silencing strategy: 
one promotes transcriptional repression in the embryo, the other 
reinforces silencing at the post-transcriptional level after birth. 

To investigate the biochemical role of Miwi, we affinity-purified 
Miwi ribonucleoprotein (RNP) complexes from mouse testes. They 
contained Miwi, its interaction partners*” (Supplementary Table 1), 
and an abundant species of ~30-nucleotide (nt) small RNAs*® (Fig. la 
and Supplementary Fig. la-c). Deep sequencing of Miwi-associated 
piRNAs from developmentally staged mouse testes revealed that Miwi 
is primed with a largely similar population of piRNAs in meiotic and 
post-meiotic germ cells (Supplementary Fig. 1d—g and Supplementary 
Table 2). 

Some members of the Argonaute family, including Piwi proteins 
are small RNA-guided nucleases (slicers) that catalyse cleavage of 
target nucleic acids’’. The slicer activity depends on the presence of a 
catalytic motif (Asp-Asp-His; DDH motif)'*"* in the Piwi domain of 
Argonautes, which is also conserved in Miwi. To biochemically con- 
firm this activity, we incubated’ Miwi complexes with an RNA target 
bearing perfect complementarity to one Miwi-associated piRNA 
(piR-A). The target was cleaved (Fig. 1b) at a position 10-nt down- 
stream of the 5’ end of the guide piRNA, similar to that produced by the 
established slicer, human AGO2 (ref. 13; Supplementary Fig. 2d). The 
addition of EDTA abolished the endonucleolytic cleavage, proving the 
Mg** dependency of the reaction'*’* (Fig. 1b). DNA was not recog- 
nized as a substrate (Supplementary Fig. 2c). These results confirm that 
Miwi is a bona fide slicer. 

In agreement with crystal structures'* of Argonaute catalytic com- 
plexes, mismatch between the first position of the piRNA and the 
target did not affect cleavage activity (Fig. 1c). Although mismatches 


10,11 
> 


at several 3'-terminal nucleotides were tolerated, base pairing of 
nucleotides 2-22 was required for efficient target cleavage by Miwi 
(Fig. 1c, d). Similar assays with a second target (for guide piR-E) gave 
consistent outcomes, but highlighted the need for increased com- 
plementarity to the target (Supplementary Fig. 2f). Hence, we conclude 
that Miwi-associated piRNAs guide endonucleolytic cleavage of target 
RNAs with extensive complementarity. 

To determine the role of Miwi catalytic activity in vivo, we generated 
mice (Miwi*?") with a point mutation in Miwi at the first aspartate 
(D633A) of the catalytic motif (Supplementary Fig. 3), a mutation 
known to disrupt the activity in other slicers'™"*!7"*. The Miwi*?" 
mutation was dominant negative, as Miwi +/ADH male mice were sterile. 
Female fertility was unaffected by this mutation. As expected, 
Miwi'’“?" testes extracts had slightly reduced slicer activity compared 


to the wild type (Fig. 2a). Spermatogenesis in Miwi*/“?" mice closely 


resembled that of the control mice (Miwi*’) until spermiogenesis!” 
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Figure 1 | Miwi is a slicer requiring extensive complementarity for target 
cleavage. a, Coomassie-stained immunopurified (BVI antibody) Miwi 
complexes and 5’-end-labelled associated small RNAs (piRNAs). IP, 
immunoprecipitates. HC and LC are antibody heavy and light chains, 
respectively. b, Slicer assays with Miwi complexes (purified with BVI or BTO 
antibodies) or beads alone (contol) and a 5’-end-labelled RNA target bearing 
complementarity to endogenous piR-A (detected by northern blotting, 
bottom). Addition of EDTA abolishes Miwi slicer activity. Arrow indicates the 
5' cleavage product. c, d, Slicer assays with target RNAs carrying mismatches at 
indicated positions relative to the guide piRNA. Protein molecular weight 
markers in kDa and RNA markers in nucleotides (nt) are shown. 
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Figure 2 | Miwi catalytic activity is essential for spermatogenesis. a, Slicer 
assays with Miwi complexes from the Miwi'/4?” mutant. Total piRNAs are 
5'-end labelled, whereas specific piRNAs are detected by northern blotting. 
b, Haematoxylin and eosin-stained testis sections showing the spermatogenic 
arrest in different indicated Miwi genotypes. ES, elongating spermatids; RS, 
round spermatids; PS, pachytene spermatocyes. ¢, Slicer assay with Miwi*?P# 
complex from the Miwi ‘4? mutant. d, Electron micrographs showing 
morphologically aberrant elongating spermatids in the Miwi*’“?" mutant. 

e, Electron micrographs of round spermatids showing the chromatoid body 
(CB) or its absence (residual density marked with a bracket) in Miwi mutants. 
N, nucleus. 


(Fig. 2b and Supplementary Fig. 4a). Elongating spermatids are seen in 
Miwi*/4?™ seminiferous tubules, but they exhibit abnormal nuclear 
morphologies (Fig. 2d) and mutant sperm were degenerated in 
the epididymis (Supplementary Fig. 5b). Timing and levels of 
chromatin-compacting protamine expression is essential for spermio- 
genesis”. Immunostaining of Miwi’/“?" mutant testis sections 
revealed an altered pattern for protamine 2 (Prm2), with staining 
detected around groups of aggregated elongating spermatid nuclei 
(Supplementary Fig. 6). However, the present state of our analyses does 
not allow precise determination of the molecular reason for the sterility 
observed in the Miwi*/4?" mutant. 

To obtain males exclusively expressing the Miwi*?"' protein, we 
generated Miwi “P# mice (hereafter referred to as the catalytic 
mutant). Miwi slicer activity was completely abolished in the catalytic 
mutant (Fig. 2c and Supplementary Fig. 7b). This loss of catalytic activity 
was without any change in protein abundance, sub-cellular localiza- 
tion, arginine methylation or the piRNA-binding capacity of Miwi 
(Supplementary Figs 7 and 8). Similar to the Miwi ‘~ knockout, 
catalytic mutant males were sterile, with spermatogenesis being 
arrested after meiosis at haploid round spermatids (Fig. 2b and 


LETTER 


Supplementary Fig. 4a). Both the mutants showed defective organiza- 
tion of the ribonucleoprotein (RNP) granule called the chromatoid 
body”’ in their arrested round spermatids (Fig. 2e and Supplemen- 
tary Fig. 8a). The loss of chromatoid body integrity in the catalytic 
mutant was without any changes in the sub-cellular localization 
of several Miwi-interacting proteins* (Supplementary Fig. 8e). 
Furthermore, the gene expression profiles of round spermatids were 
highly similar between the knockout and catalytic mutants (see later 
and Fig. 3d). These observations allowed us to conclude that most 
functions of Miwi are a result of its slicer activity. 
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Figure 3 | Primary piRNA biogenesis is globally unaffected in the Miwi*?” 
mutants and mRNAs are not targeted by piRNAs. a, Ethidium bromide 
staining of total small RNAs. b, Miwi-associated piRNAs detected by 5'-end 
labelling from indicated Miwi genotypes. c, Normalized density of Miwi piRNA 
reads over a genomic cluster. d, MA plots showing comparison of gene 
expression profiles in purified round spermatids from indicated Miwi 
genotypes. Significantly (P < 0.001) changed genes are highlighted in red. The 
top 100 genes having either exonic sense (red) or exonic antisense (green) 
piRNAs are highlighted (bottom right). e, Northern blot analyses of select 
(circled in d) late spermatogenesis genes and loading control (rRNA staining 
with ethidium bromide) are shown. 
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On the basis of their high preference for a uridine at the first 
nucleotide position (U1 bias) and their origin from the same strand 
within a genomic cluster, Miwi piRNAs are believed to be generated 
by the primary piRNA biogenesis pathway”. Sequencing of Miwi- 
associated piRNAs® from the Miwi*“?" and Miwi 4?" mutants 
revealed unchanged genome annotation profiles and piRNA cluster 
mappings, consistent with the view that primary piRNA biogenesis is 
independent of the slicer activity of Piwi proteins’ (Fig. 3a-c and 
Supplementary Fig. 9 and Supplementary Table 2). Nevertheless, we 
note that the abundance of piR-A was affected in the Miwi 4?" 
mutant (Supplementary Fig. 7c, d), probably owing to the develop- 
mental block’. 

The slicer activity of Miwi is reminiscent of the classical RNA inter- 
ference (RNAi) action on target RNAs, which drives their rapid 
decay’’. To study the effect of Miwi slicer activity on gene expression, 
we examined mRNA levels in purified round spermatids using micro- 
arrays (Fig. 3d and Supplementary Fig. 5c). When compared to the 
control (Miwi*’~), several genes were affected in the knockout and 
catalytic mutants, some of which we confirmed by northern blot ana- 
lyses (Fig. 3e). Importantly, the differences in gene expression profiles 
are not a result of mRNA transcripts being targeted by piRNAs (Fig. 3d 
and Supplementary Table 3). 

To examine whether other cellular RNAs could be targeted by Miwi 
piRNAs, we analysed RNA-seq reads from purified round spermatids 
of the control (Miwi*’), and the Miwi*/4?" and Miwi 4?" mutants 
(Supplementary Fig. 10). Repeat-derived piRNAs constitute ~24% of 
Miwi-associated sequences (Supplementary Fig. 1d), and could 
potentially guide Miwi slicer activity to transposon transcripts found 
in post-meiotic germ cells. Indeed, we observed a strong increase of 
LINE-specific reads in the catalytic mutant RNA-seq library (Fig. 4a 
and Supplementary Fig. 10c). This was quantified by quantitative 
polymerase chain reaction with reverse transcription (qRT-PCR) for 
LINE] (L1) elements to be 7- to 22-fold higher in total RNA from 
Miwi ‘“P" round spermatid preparations (Fig. 4b). Western blotting 
and immunofluorescence detection of LLORF1p”, a protein product 
expressed from active L1 elements, in the catalytic mutant round 
spermatids substantiated L1 upregulation (Fig. 4c, d and Supplemen- 
tary Fig. 1la-c). A similar pattern of L1 upregulation was seen in the 
Miwi “knockout mutant (Fig. 4b-d). Phosphorylated histone H2AX 
(y-H2AX) normally marks sites of double-stranded DNA breaks in 
early meiotic cells, and the XY chromosomes in pachytene spermatocytes. 
Coincident with LIORF 1p detection, we observed increased staining 
for y-H2AX in the mutant round spermatids (Supplementary Fig. 11). 
This is similar to the upregulation of y-H2AX observed in radiation- 
damaged round spermatids”. Therefore, it is clear that the resultant 
sterility phenotype of Miwi knockout and catalytic mutants is 
multifactorial, extending beyond the reported roles of Miwi in trans- 
lation control’* and stabilization of certain mRNAs’. 

Slicer activity generates a 5'-monophosphate-containing cleavage 
fragment that is complementary to the small RNA guide by 10 nt, across 
their 5’ ends'*'®. We searched a global 5’-RACE (rapid amplification of 
5’ complementary DNA ends) library from control (Miwi*’~) round 
spermatids and detected an enrichment of genomic sites with 5’-RACE 
tags that overlap Miwi piRNAs by 10 nt (Fig. 4e and Supplementary 
Fig. 12). Several such overlapping RACE tags derive from LINE 
elements, consistent with Miwi-mediated processing of these retro- 
transposon mRNAs (Fig. 4e). By contrast, in the catalytic mutant, 
genomic sites with RACE tags overlapping across the full length of 
Miwi piRNAs are more abundant (peak at 29 nt; Fig. 4e), revealing the 
presence of uncleaved target RNAs, including those from LINE 
elements. The signature for slicer activity detected in the catalytic 
mutant is probably due to Mili”*, which is another potential slicer 
loaded with similar guides (Supplementary Fig. 13). 

L1 silencing in embryonic** and pre-meiotic*® male germ cells is 
linked to a slicer-dependent piRNA amplification pathway, a signature 
of which is the presence of a 10-bp overlap between the 5’ ends of 
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Figure 4 | Miwi piRNAs target L1 transposons. a, Scatter plot showing fold 
change of repeat reads in the RNA-seq library from the Miwi ““?" round 
spermatids compared to control (+/—). LINE-specific reads are indicated in 
red and upregulated LINEs are circled. b, Quantitative RT-PCR analysis of L1 
transcripts in whole testis and purified round spermatids. Each bar represents 
individual biological replicate and standard deviation (s.d.) between technical 
replicates within a biological sample is indicated. c, d, Detection of LIORF1p in 
purified round spermatids by western blotting (c) and by immunofluorescence 
analysis (d) of testis sections. e, Genome-wide 5'-end overlap analysis of global 
5'-RACE tags and Miwi piRNAs (top). Similar overlaps over a specific L1 
sequence (bottom). f, 5’-end overlap analysis of Miwi piRNAs. The s.d. between 
three independent biological replicates is indicated. 


piRNAs. We find that the Miwi*?" mutation had no significant 
impact on a weak amplification signature detected in Miwi piRNA 
populations’ (Fig. 4f). There was also no global change in the presence 
of repeat-derived piRNA reads or in DNA methylation of L1 elements 
in the catalytic mutant (Supplementary Table 4 and Supplementary 
Fig. 14). Thus, the post-transcriptional mechanism we describe here 
probably reinforces silencing of transposons that escape repression 
during spermatogenesis. 

Our data indicate that a substantial portion of the biology of Miwi is 
implemented via its slicer activity, which in turn implies a role for its 
associated piRNAs. Here, we identified a role for the repeat component 
of Miwi piRNAs, but those deriving from unannotated genomic 
regions still remain a mystery. These piRNAs are not complementary 
to any other parts of the genome, thus the only possible targets are 
RNAs arising from the opposite strands of the piRNA clusters. 
However, we did not observe any significant upregulation of tran- 


scripts from the opposite cluster strands in the Miwi*?" mutants. It 
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remains a possibility that slicer cleavage by Miwi may not always lead 
to RNA decay but could serve as a signal for processes involved in late 
spermatogenesis. 


METHODS SUMMARY 


Immunoprecipitations were carried out using anti-Miwi antibodies (BVI and 
BTO) from mouse testes extracts. Slicer assays'> were carried out using purified 
Miwi complexes and 5'-end-labelled target RNAs complementary to endogenous 


piRNAs. The catalytic mutant mouse carries a mutation of the endogenous Miwi 


locus’ converting the first aspartate of the DDH motif to alanine (Miwi?4), 


RNAs were deep sequenced using the Illumina platform. Total RNA from purified 
round spermatids was prepared using Trizol and analysed by gene microarrays 
and deep sequencing. A detailed description can be found in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Antibodies. For the production of anti-Miwi polyclonal antibodies, a GST fusion 
of the first 200 amino acids of Miwi was produced in Escherichia coli strain BL21 
and injected into two rabbits (BVI and BTO) (Laboratory Animal Resource (LAR) 
EMBL Heidelberg). The BVI antibody was previously characterized”*. The BTO 
antibody shows similar specificity to BVI in immunoprecipitations as judged by 
Coomassie staining of Miwi complex proteins (Supplementary Fig. 1b), size of 
small RNAs (Supplementary Fig. 1c), small RNA sequencing (data not shown), 
and by western blotting (Supplementary Fig. 7a for BVI and Fig. 4c for BTO). Both 
antibodies were used in immunofluorescence experiments (BVI as reported”* and 
Supplementary Fig. 8b for BTO) and gave similar results. 

The anti-Mili monoclonal antibody (13E-3)**, anti-Tdrdl (refs 28, 29), 

anti-Tdrd6 (ref. 30), anti-Tdrd7 (ref. 30), anti-L1 ORF1p rabbit polyclonal” 
(S. L. Martin), anti-Sm Y12 mouse monoclonal recognizing symmetrical dimethyl 
arginines*’ (D. Schuemperli) are described. Rabbit polyclonal antibodies to transi- 
tion proteins Tnp1, Tnp2 (ref. 32) (W. S. Kistler), and protamines Prm1 and Prm2 
(ref. 33) (R. Balhorn) are also described. Additional antibodies were purchased: 
anti-B-tubulin (Abcam), anti-y-H2AX (Upstate/Millipore), anti-Mili (Abcam), 
anti-Miwi (Cell signaling technology) and anti-Mvh (Abcam). 
Miwi complex purification. A single testis was dounce-homogenized in 1 ml 
extract buffer (10% glycerol, 50mM Tris/HCl pH 8.0, 150mM NaCl, 5mM 
MgCh, 0.5% Na deoxycholate, 1% Triton X-100, 1 protease inhibitors (Roche), 
50 ug ml 'tRNA and 1X vanadyl ribonucleoside complex (Sigma)). The extract was 
centrifuged twice at 9,200g, 4 °C, 15 min. Anti-Miwi or -Mili antibodies were bound 
to Protein G Sepharose overnight with gentle rocking at 4°C in IP buffer (10 mM 
Tris-HCl pH 8.0, 150 mM NaCl, 1 mM MgC],). After washes in IP buffer, extract was 
added and incubation continued for 2-5h at 4°C. After washes, complexes were 
used in slicer assays. Bound piRNAs were extracted by proteinase K treatment, 
phenol-choloroform extraction and further visualized by 5’-end labelling. 

For Miwi complex purification (Fig. la and Supplementary Fig. 1b), affinity 
purified anti-Miwi antibodies (25-100 ug BVI or BTO) were chemically cross- 
linked to protein G sepharose using dimethyl pimelimidate (DMP; Sigma 
D-8388). For proteomic analysis the beads-bound material was eluted with 
0.1 M glycin pH 2.5-3.0, 150 mM NaCl, and after methanol precipitation, separated 
by SDS-PAGE. 

After staining with Coomassie blue, all visible bands and intervening gel spaces 

were excised for identification by liquid chromatography-mass spectrometry 
(LC/MS/MS) (Proteomics Core Facility, EMBL Heidelberg). Briefly, after tryptic 
digest, peptides were separated by liquid chromatography using the nanoAcquity 
UPLC system (Waters) fitted with a nanoAcquity Symmetry C,, trapping and 
analytical column. Peptides were directly fed into a LTQ Orbitrap Velos (Thermo 
Fischer Scientific). Software MaxQuant was used for filtering the data and its 
preparation for MASCOT search. An unedited table with all identified proteins 
and the number of peptides determined is provided (Supplementary Table 1). 
Specificity of the identified proteins was determined by control purifications with 
protein G sepharose beads alone or protein G coupled to rabbit IgG (anti-HA). 
Some of the identified proteins, such as tudor proteins (Tdrd1 and Tdrd6)*” and 
kinesin motor protein Kif17 (ref. 34) were already identified in previous proteomic 
and biochemical studies to interact with Miwi, whereas the presence of others, 
such as Mili, in this list might be due to their co-localization in the chromatoid 
body**". 
Slicer assays. A synthetic 40-nt RNA (for piR-A) or in vitro transcribed RNAs (for 
piR-A and piR-E), both bearing complementarity to endogenous piRNAs, were 
used as targets (Supplementary Fig. 2). Target RNAs were gel eluted from 15% 
urea-polyacrylamid gels and 5’-end labelled. 

Miwi was immunopurified from testis extract (1 testis per ml buffer, ~0.5 pg 
anti-Miwi antibody bound to ~10 pl protein G sepharose). After the complete 
removal of wash buffer, beads were pre-incubated for >10 min in 10,1 slicer 
buffer’. (10 mM Tris/HCl pH 7.5, 100mM KCl, 2mM MgCh, 40 U RiboLock 
(Fermentas)) on ice. The assay was started by adding 10, 5’-labelled 
(10-50 fmol), denatured RNA (55 °C for 5-10 min followed by immediate transfer 
to ice). The reaction was performed in a thermoshaker (Eppendorf) for 0.5-2 h at 
35°C with constant changing modes of shaking (1,000r.p.m., 10s) and pause 
(90s). The reaction was stopped by proteinase K treatment followed by phenol- 
chloroform extraction and precipitation. RNA was resolved by 15% urea~PAGE. 
Gels were dried and exposed to Storage Phosphor screens (GE Health). 

We tested 14 different piRNAs by northern blotting to judge abundance and to 
identify suitable guides for slicer assays (data not shown). Of these only piR-A and 
piR-E guides gave detectable signals in the slicer assays reported here. So it remains 
a possibility that not all Miwi piRNPs are competent for slicer activity. 
Northern blot. Northern analysis of small RNAs was performed on 2.5-10 pg 
total RNA or Miwi-associated small RNAs separated by 15% urea~PAGE. When 
required (Supplementary Fig. 7c), RNAs were first treated by beta-elimination®” 


before PAGE separation. Small RNAs were detected with the following 5’ -labelled 
DNA probes: piRNA-A, AAAGCTATCTGAGCACCTGTGTTCATGTCA; 
piRNA-E, ACCAACTCTGTCTTCAGAATATACATCAACT; and miRNA-16, 
GCCAATATTTACGTGCTGCTA. Probes for the detection of longer transcripts 
were generated from DNA fragments or PCR products by random priming 
(Roche). 

Miwi*?” and Miwi null mice. The catalytic triad of Argonaute proteins is con- 
stituted by three amino acids (DDH motif)'*"*”°. The first aspartate of this motif in 
Miwi is amino acid number 633, coded in exon 17. We mutated D633 to alanine 
(D633A) by gene targeting in BRUCE4 embryonic stem (ES) cells’. We replaced 
the exons 16 and 17 with a fragment containing exon 16 and a modified exon 17 
with an alteration in a single codon that generates the ADH (D633A) point 
mutation. An frt-flanked neomycin (Neo) resistance cassette and a loxP site, along 
with an EcoRI site were placed 3’ of Miwi’ exon 17. Another loxP site was placed 5’ 
of exon 16. These modifications to the Miwi locus are termed the Miwi targeted 
(Miwi‘) allele (Supplementary Fig. 3). Southern blotting with two independent 
Miwi probes and EcoRI-digested genomic DNA from different ES cell clones 
confirmed proper integration into the targeted locus (data not shown). The 
modified ES cells were used by standard techniques to generate Miwi*“ mice 
(Transgenic Mouse Facility, EMBL Monterotondo) and Southern blotting 
confirmed the existence of the recombined loci in targeted mice (Supplemen- 
tary Fig. 3). Presence of a single 24.6-kb fragment with both probes identifies 
the wild-type allele. Owing to the integration of an additional EcoRI site, the 5’ 
probe identifies 12.6 kb whereas the 3’ probe identifies 14-kb fragments from the 
targeted allele. 

Excision of frt-flanked neomycin resistance cassette to generate the Miwi*?” 
knock-in allele was achieved by crossing Miwi'“ mice with the transgenic 
FLPeR* mice that ubiquitously express the Flp recombinase. To generate the 
Miwi null (Miwi /~) allele, Miwi*” mice were crossed with Deleter Cre*’ mice that 
ubiquitously express the Cre recombinase, which excised the neomycin resistance 
cassette and exons 16 and 17 that were flanked by the inserted loxP sites. It most likely 
results in a transcript containing a premature stop codon, leading to mRNA decay via 
NMD”. We did not detect any Miwi protein or truncated versions in our Miwi /~ 
mice by western blotting (Supplementary Fig. 7a). All mice were maintained in the 
C57/BL6 genetic background. 

Females carrying the Miwi null and ADH alleles in all combinations are fertile. 
We intercrossed Miwi/~ males with Miwi'/4>"' females to generate Miwi 4?" 
mice, which are referred to as catalytic mutant. PCR amplification of the relevant 
DNA region from the different genotypes identified the presence of the correct 
GAC (D633) to GCC (D633A) mutation in exon 17 in the Miwi*?" mutants 
(Supplementary Fig. 3c). All of the mice were bred and maintained in the 
EMBL Mouse Biology Unit, Monterotondo in accordance with current Italian 
legislation under license from the Italian health ministry. 

Immunofluorescence analysis. Immunofluorescence analysis”* was performed on 
paraffin or OCT (Tissue-Tek Cryo-OCT compound, Fischer Scientific) embedded 
testes sections. Primary antibodies (1:50-1:500) and Alexa-488- or 555-coupled 
secondary antibodies (Invitrogen) were diluted (1:5,000) before use. DNA was 
stained with Hoechst 33258 and slides were mounted with Vectashield (Vector 
Labs). PNA rhodamine (Vector Labs) was purchased. Anti-y-H2AX (Upstate) and 
mouse monoclonal y-H2AX peptide (phospho-S139, Abcam) were purchased. The 
antibody (11g) and peptide (10 times the antibody weight) was pre-incubated 
overnight at 4°C, after which the immunostaining procedure continued as usual. 
Samples were examined by fluorescence microscopy (Olympus BX61) and images 
captured using CCD (Olympus DP70). Miwi localization in the different genotypes 
was examined using a Leica TCS SP2 AOBS, inverted microscope. The Tdrd9 
mutant was described previously"’. 

Global 5'-RACE library. Total RNA from purified round spermatids (Miwi*/~ 
and Miwi “?") was depleted of rRNAs using the Ribo-Zero kit (Epicentre) and 
then ligated to the Illumina 5’RNA adaptor in a 10 ul reaction (10 U RNA ligase, 
1mM ATP, 0.75 1M 5’-adaptor, 20 U RNase inhibitor) at 20 °C overnight. Only 
RNAs carrying a natural 5’ monophosphate should be ligated because a kinase step 
was omitted. RNAs were re-purified using the Absolutely RNA kit (Stratagene). 
This step allows the removal of a majority of short RNAs (<100nt, piRNAs, 
adapters) from the RNA sample. The eluted RNA was phenol-chloroform 
extracted and precipitated. Reverse transcription was performed in a 20 pil reaction 
with 100pmol random primer (5’-ACAAGCAGAAGACGGCATACGA-Ng, 
where N is a random nucleotide), 0.5mM dNTP, 1X RT buffer, 5mM DTT, 
20 U RNase inhibitor and 200 U reverse transcriptase. The reaction mix was first 
incubated at 25°C for 2min. After addition of the enzyme, the reaction was 
incubated for 5 min at 25°C, followed by a step at 50°C for 40 min and 55 °C 
for 20 min. 50% of the RT reaction was used for PCR (98 °C, 30s; 35 cycles: 98 °C, 
10s; 60°C, 30s; 72 °C, 15; final amplification: 72 °C, 5 min). PCR reaction was 
carried out with 1 U of Phusion High-Fidelity DNA Polymerase, 0.2 mM dNTP, 
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1.5mM MgCl, and 250 uM primer. PCR products of ~200 bp were gel-eluted for 
sequencing (105 cycles) by the Illumina Genome Analyser (EMBL Gene core 
facility). 

Small RNA and RNA-seq library preparation. Miwi-associated piRNAs were 
isolated from testes of wild-type mice of different ages: 14, 20 days after birth (P14, 
P20); and from adult mice (>P60). Adult males were used for the generation of 
libraries from the Miwi*/4?" and Miwi “?" mutant. Triplicates from independent 
biological samples were prepared for the catalytic mutant Miwi 4?", One library 
was made from the Miwi*““?" mutant. Libraries were prepared by sequential 
ligation of 5'- and 3’-RNA adapters. Products were gel eluted after each ligation 
step. Sequencing was for 36 cycles using the Illumina Genome Analyser IIx (EMBL 
Gene core facility). 

Round spermatids were purified from Miwi*’~, Miwi 4?" and Miwi*/4?7 
adult animals using a BSA gradient**. We note that in such a purification protocol, 
the control (+/—) will contain the full spectrum of round spermatid stages, 
whereas the mutants will have only those until the respective arrested stages. 
Total RNA was extracted using Trizol and depleted of abundant RNAs like 
rRNAs using Ribo-Zero kit (Epicentre). Strand-specific RNA-seq library was 
prepared using the ScriptSeq mRNA-seq Library Preparation Kit (Epicentre; 
Cat no. SS10906) and sequenced (EMBL Gene core facility, Heidelberg). We 
obtained ~20 million reads for each genotype from 76-cycle Illumina runs. 
Bioinformatics analyses. Reads from piRNA libraries were mapped to the 
genome and only perfect mappers were selected for further analysis. In-house 
bioinformatics tools developed by the RS. laboratory and used in this study are 
described“*. For cluster analysis, reads were mapped on to 2-kb tiles with 
minimum of 50 hits and moved in steps of 1 kb. The top clusters from such an 
analysis are shown for all the Miwi libraries (Supplementary Table 2). 

5'-end overlap analysis. The three wild type (+/—) and the three catalytic 
mutant (—/ADH) piRNA libraries were read-depth normalized and 27-31-nt- 
long perfect and unique mapping reads were used in the analysis. Using a window 
of 300 nt, we calculated genome-wide, the correlation of 5’ ends from opposite 
strands with the pairs function, as previously described’. Afterwards, each 
obtained correlation profile was normalized by dividing through the total correla- 
tion measured within 300 nt and the mean and s.d. for the three +/— and three 
—/ADH combined correlation profiles were determined. The genomic loci giving 
rise to a predominant peak at 10 was investigated further. This is indicative of the 
presence of reads with their 5’ ends overlapping by 10 nt. The genomic annotation 
of those loci was determined and found to be very similar to that seen for Miwi 
piRNAs—with a large component of unannotated reads (data not shown). 

Wealso investigated the 5'-end overlap distances of piRNAs with the 5’ ends of 
reads from the RACE library genome-wide (Fig. 4e and Supplementary Fig. 12). 
Here, we calculated the number of sites carrying a particular 5’ distance, as previ- 
ously described”. We also calculated the overlap over a LINE element (L1Md_F2; 
chr2:69191898-69194424) that is upregulated in the catalytic mutant (Fig. 4e and 
Supplementary Fig. 10c). 

Microarray analysis. Round spermatids were purified from Miwi’’, 
Miwi ‘AP (catalytic mutant), Miwi ‘“~ and Miwi*?* adult mice using a 
BSA gradient method’. Gene expression profiles were analysed by Affymetrix 
gene array, in biological duplicates (EMBL Gene Core, Heidelberg). Microarray 
analysis was carried out in R/Bioconductor™. First, the microarrays were corrected 
for background with RMA and quantile-normalized with the aroma package”*. 
Duplicates were combined and significant (P <0.001) changing genes were iden- 
tified with limma*’”. To remove unexpressed genes from the further analysis, genes 
having log, intensity higher than 5.6 in at least one of the four sets were selected as 
expressed genes. 
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RNA-seq analysis. All mapped reads were normalized to an artificial library size 
of 1 million reads, in order to allow comparisons across libraries. For the read- 
depth normalized RNA-seq libraries, we determined the expression of genes by 
adding up all unique mapping reads falling into the exonic region of a particular 
gene. Genes where considered to be expressed when they carried at least 5 
reads per million in one of the libraries. For these genes the fold changes were 
determined. 


28. Reuter, M. et a/. Loss of the Mili-interacting Tudor domain-containing protein-1 
activates transposons and alters the Mili-associated small RNA profile. Nature 
Struct. Mol. Biol. 16, 639-646 (2009). 

29. Chuma, S. et al. Tdrd1/Mtr-1, a tudor-related gene, is essential for male germ-cell 
differentiation and nuage/germinal granule formation in mice. Proc. Nat! Acad. Sci. 
USA 103, 15894-15899 (2006). 

30. Tanaka, T. et a/. Tudor domain containing 7 (Tdrd7) is essential for dynamic 
ribonucleoprotein (RNP) remodeling of chromatoid bodies during 
spermatogenesis. Proc. Nat! Acad. Sci. USA 108, 10579-10584 (2011). 

31. Brahms, H. et a/. The C-terminal RG dipeptide repeats of the spliceosomal Sm 
proteins D1 and D3 contain symmetrical dimethylarginines, which form a major 
B-cell epitope for anti-Sm autoantibodies. J. Biol. Chem. 275, 17122-17129 
(2000). 

32. Heidaran, M.A. Showman, R. M. & Kistler, W. S. A cytochemical study of the 
transcriptional and translational regulation of nuclear transition protein 1 (TP1),a 
major chromosomal protein of mammalian spermatids. J. Cell Biol. 106, 
1427-1433 (1988). 

33. Stanker, L.H., Wyrobek, A. & Balhorn, R. Monoclonal antibodies to human 
protamines. Hybridoma 6, 293-303 (1987). 

34. Kotaja, N., Lin, H., Parvinen, M. & Sassone-Corsi, P. Interplay of PIWI/Argonaute 
protein MIWI and kinesin KIF17b in chromatoid bodies of male germ cells. J. Cel! 
Sci, 119, 2819-2825 (2006). 

35. Kirino, Y. & Mourelatos, Z. Mouse Piwi-interacting RNAs are 2’-O-methylated at 
their 3’ termini. Nature Struct. Mol. Biol. 14, 347-348 (2007). 

36. Qi, Y. et al. Distinct catalytic and non-catalytic roles of ARGONAUTE4 in RNA- 
directed DNA methylation. Nature 443, 1008-1012 (2006). 

37. Kontgen, F. & Stewart, C. L. Simple screening procedure to detect gene targeting 
events in embryonic stem cells. Methods Enzymol. 225, 878-890 (1993). 

38. Farley, F. W., Soriano, P., Steffen, L. S. & Dymecki, S. M. Widespread recombinase 
expression using FLPeR (flipper) mice. Genesis 28, 106-110 (2000). 

39. Schwenk, F., Baron, U. & Rajewsky, K. A cre-transgenic mouse strain for the 
ubiquitous deletion of loxP-flanked gene segments including deletion in germ 
cells. Nucleic Acids Res. 23, 5080-5081 (1995). 

40. Wen, J. & Brogna, S. Nonsense-mediated mRNA decay. Biochem. Soc. Trans. 36, 

514-516 (2008). 

Al. Shoji, M. et al. The TDRD9-MIWI2 complex is essential for piRNA-mediated 

retrotransposon silencing in the mouse male germline. Dev. Cell 17, 775-787 

(2009). 

42. Karginov, F. V. et al. Diverse endonucleolytic cleavage sites in the mammalian 

transcriptome depend upon microRNAs, Drosha, and additional nucleases. 

Mol. Cell 38, 781-788 (2011). 

43. Pivot-Pajot, C. et al. Acetylation-dependent chromatin reorganization by BRDT, a 

testis-specific bromodomain-containing protein. Mol. Cell. Biol. 23, 5354-5365 

(2003). 

44. Olson, A. J., Brennecke, J., Aravin, A.A., Hannon, G. J.& Sachidanandam, R. Analysis 

of large-scale sequencing of small RNAs. Pac. Symp. Biocomput. 126-136 (2008). 

45. Gentleman, R. C. et al. Bioconductor: open software development for 

computational biology and bioinformatics. Genome Biol. 5, R80 (2004). 

46. Bengtsson, H., Wirapati, P. & Speed, T. P. A single-array preprocessing method for 

estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays 

including GenomeWideSNP 5 & 6. Bioinformatics 25, 2149-2156 (2009). 

47. Smyth, G.K. Linear models and empirical Bayes methods for assessing differential 
expression in microarray experiments. Stat. Appl. Genet. Mol. Biol. 3, article 3 
(2004). 


©2011 Macmillan Publishers Limited. All rights reserved 


| sid ial Be 


doi:10.1038/nature10577 


Atomic-resolution dynamics on the surface of 
amyloid-f protofibrils probed by solution NMR 
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Exchange dynamics between molecules free in solution and bound 
to the surface of a large supramolecular structure, a polymer, a 
membrane or solid support are important in many phenomena 
in biology and materials science. Here we present a novel and 
generally applicable solution NMR technique, known as dark-state 
exchange saturation transfer (DEST), to probe such exchange 
phenomena with atomic resolution. This is illustrated by the 
exchange reaction between amyloid-B (AB) monomers and poly- 
disperse, NMR-invisible (‘dark’) protofibrils, a process of significant 
interest because the accumulation of toxic, aggregated forms of Af, 
from small oligomers to very large assemblies, has been implicated 
in the aetiology of Alzheimer’s disease’. The '"N-DEST experiment 
imprints with single-residue-resolution dynamic information on 
the protofibril-bound species in the form of '°N transverse relaxa- 
tion rates (1°N-R,) and exchange kinetics between monomers and 
protofibrils onto the easily observed two-dimensional ‘H-'°N 
correlation spectrum of the monomer. The exchanging species on 
the protofibril surface comprise an ensemble of sparsely populated 
states where each residue is either tethered to (through other 
residues) or in direct contact with the surface. The first eight 
residues exist predominantly in a mobile tethered state, whereas 
the largely hydrophobic central region and part of the carboxy 
(C)-terminal hydrophobic region are in direct contact with the pro- 
tofibril surface for a significant proportion of the time. The 
C-terminal residues of both AB40 and Af42 display lower affinity 
for the protofibril surface, indicating that they are likely to be 
surface exposed rather than buried as in structures of Af fibrils”""°, 
and might therefore comprise the critical nucleus for fibril forma- 
tion’, The SN—Riethered values, however, are significantly larger 
for the C-terminal residues of AB42 than AB40, which might explain 
the former’s higher propensity for rapid aggregation and fibril 
formation’*™*. 

We recently reported the existence of a dynamic exchange process 
under pseudo-equilibrium conditions between monomeric and very 
high molecular mass, soluble species of AB40 (ref. 15). The latter form 
spontaneously in concentrated (150-300 1M) unstirred aqueous solu- 
tion, and comprise polydisperse, non-fibrillar aggregates typical of 
worm-like protofibrils (Supplementary Fig. 1a). Similar protofibrils are 
formed by AB42 under the same conditions (Supplementary Fig. 1b). 
These protofibrils, which we have characterized extensively by atomic 
force and electron microscopy (Supplementary Fig. 1), analytical ultra- 
centrifugation (Supplementary Fig. 3) and dynamic light scattering 
(Supplementary Fig. 4) are far too large (1.8-85 MDa) to detect directly 
using solution NMR spectroscopy as their resonances are broadened 
beyond detection’’, and their heterogeneous nature makes biophysical 
characterization challenging*’*’. Both the AB40 and AB42 protofibrils 
present in the equilibrated concentrated samples react strongly with the 
OC polyclonal antibody that recognizes so-called ‘fibrillar’ oligomers 
and fibrils* (Supplementary Fig. 6b); the All polyclonal antibody 
recognizes so-called ‘pre-fibrillar’ oligomers’ and reacts with both 


AB40 and AB42 protofibrils, albeit weakly with the former (Sup- 
plementary Fig. 6a). The predominantly unstructured, intrinsically 
disordered, monomeric AB peptides*”’, however, give rise to excellent 
solution NMR spectra’* (Supplementary Fig. 7), and analytical ultra- 
centrifugation indicates unambiguously that the NMR visible species is 
monomeric, and that there are no measurable quantities of low 
molecular mass oligomers present in either the dilute (Supplemen- 
tary Fig. 2) or concentrated (Supplementary Fig. 5) samples. Indeed, 
in the concentrated samples containing monomer and protofibrils, 
there is no evidence by analytical ultracentrifugation of any species 
other than monomer and protofibrils with M, in excess of 1.8 MDa 
(Supplementary Figs 4 and 5). In this work, we probe the protofibril- 
bound species at a residue-by-residue level using '"N-DEST, a novel 
two-dimensional transfer-of-saturation NMR experiment that exploits 
the exchange process between NMR invisible (‘dark state’) and easily 
observed monomeric states. 

The essence of the two-dimensional '"N-DEST experiment is based 
on the dramatic reduction in rates of re-orientation upon binding to 
the surface of the protofibrils resulting in transverse relaxation rates 
(°N-R,) that are orders of magnitude larger than those of the free 
monomer. These large '°N-R, values preclude direct observation 
by solution NMR, but allow for efficient partial saturation of 
®N longitudinal magnetization of the protofibril-bound form by a 
weak radio-frequency field, even at large offsets where the free 
monomer magnetization is completely unaffected. In the DEST 
experiment, '°N longitudinal magnetization of the free monomer is 
prepared (INEPT and refocused INEPT transfer), transferred by 
chemical exchange to the protofibril-bound form (binding), partly 
saturated (weak '°N saturation at a series of offsets) and transferred 
back to the free monomer (dissociation) where the partial saturation is 
recorded as an attenuation of the cross-peaks of the NMR observable 
free monomer relative to that in a reference spectrum obtained without 
saturation (see Methods and Supplementary Fig. 8). 

NMR measurements were made at 10 °C, pH 6.8 using two concen- 
trations of AB: a low concentration sample (50 1M) at which both 
AB40 and AB42 remain nearly entirely monomeric (>90%) for a long 
period of time (>2 months for AB40 and approximately 1 week for 
A42) based on cross-peak intensities in a ‘H—'°N heteronuclear single 
quantum coherence correlation spectrum; and a higher concentration 
sample in which the fraction of monomer decreases rapidly (over 
approximately 2 days for AB42 and 1 week for AB40) after which a 
stable pseudo-equilibrium between monomer and protofibrils is main- 
tained for more than a week’ (Supplementary Fig. 9). The amount of 
protofibrils formed at a given initial concentration of AB is significantly 
higher for AB42 than AB40 (Supplementary Fig. 9). Thus, for 270 uM 
AB40 and 160M Af 42 samples, the concentration of monomer 
decreases to pseudo-steady-state levels of approximately 110 and 
35 UM, respectively. 

For the monomeric AB40 sample, the 'H-'°N cross-peaks in the 
DEST experiment display an extremely sharp saturation profile with 
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no detectable attenuation for any residue until the saturation pulse is 
less than 2 kHz from the '°N resonance frequencies of the monomer 
(Fig. la, circles). This behaviour, replicated for the monomeric AB42 
sample, matches the predicted saturation profile for a '°N spin with 
relaxation parameters measured for the monomeric species (Fig. 1a, 
solid line). In contrast, the 'H-'°N cross-peaks in the 270-uM AB40 
sample show significant partial saturation at offsets far from the monomer 
'°N resonances. Most importantly, the saturation profiles are clearly 
residue specific (Fig. 1b-d, circles). Relatively narrow profiles are 
observed for residues in the amino(N)-terminal region (Fig. 1b), and 
very broad profiles for residues in the central (residues 17-21, 
for example, Fig. 1c) and part of the C-terminal (residues 30-40) 
hydrophobic regions. Intermediate profiles are observed for residues 
in the hydrophilic region spanning residues 25-29 (Fig. 1d) as well as 
for the three C-terminal residues. Qualitatively similar behaviour is 
observed for the 160-uM AB42 sample (Supplementary Fig. 10a-d). 
The observed residue-specific saturation transfer profiles indicate that, 
when bound to the surface of the protofibrils, the average ISN-R, for 
each residue is highly sequence-position dependent. 

The '°N-DEST profiles can be quantitatively interpreted by 
incorporating a model of equilibrium chemical exchange within the 
framework of the McConnell equations**”’. The simplest model describ- 
ing the equilibrium between free monomer (Minonomer) and peptide 
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Figure 1 | '°N DEST and AR, for AB40. Examples of normalized cross-peak 
intensities as a function of frequency offset from the '°N carrier (118.5 p.p.m.): 
a, Glu3 in the 50 1M AB40 sample (radio-frequency field strength of '°N 
saturation pulse = 170 Hz); b, cand d, Glu 3, Leu 17 and Asn 27, respectively, in 
the 270 uM AB40 sample with '°N saturation at radio-frequency field strengths 
of 170 (orange circles) and 350 (blue circles) Hz. The solid line in a is the 
calculated saturation profile for a ‘°N spin with the experimentally determined 
relaxation rates for monomeric Af40. The dashed and solid lines in b-d are the 
best-fits to an exchange model with a single protofibril-bound state (cf. Fig. 2a) 
and to a model incorporating residues tethered and in direct contact with the 
protofibril (cf. Fig. 2b), respectively. The s.d. of the DEST experimental data 
points is approximately equal to the size of the circles. e, Observed (black filled 
circles; error bars, 1 s.d.) compared with calculated AR; values for the first (grey 
circles) and second (blue circles) models. The reduced y? for the simultaneous 
best-fit to the DEST and AR, data are 9.0 for the first model and 1.8 for the 
second. 
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transiently bound to the surface of the protofibril (Moligomer) is a 
pseudo-first-order process characterized by global first-order apparent 
association (k*PP) and dissociation (ky) rate constants (Fig. 2a) and 
residue-specific ISN-Ry (longitudinal) and R, (transverse) values for the 
free monomer and protofibril-bound states. The value of the k°PP can be 
estimated from the difference, AR>, in ISN_R, values between the 
samples with protofibrils present and the monomeric reference sample 
(50 1M)’°. Because transverse magnetization for the protofibril-bound 
states decays extremely rapidly, the maximum observed AR, value, 
AR, is equal to k2PP under pseudo-equilibrium conditions (note that 
the observed AR, values are independent of nucleus and magnetic 
field’*; see also Supplementary section 5 for a simple explanation of 
AR, and DEST experiments and their interpretation). The saturation 
profiles at two radio-frequency field strengths (170 and 350 Hz) and the 
AR; data for all residues (Fig. le for AB40 and Supplementary Fig. 10e 
for AB42) were fitted simultaneously, optimizing the values of k2PP and 
kop and the residue-specific 1SN-R, values for the protofibril-bound 
species (see Methods). The 'SN-R, and R, values for the monomer were 
taken from relaxation data recorded on the low concentration (50 LLM) 
sample. Although the °N-R; values for the protofibril-bound state are 
unknown, these N-R, values have no effect on the fitted parameters 
within their uncertainties over the expected range (2s |= '°N- 
R, =0.01s ') owing to the design of the experiment (see Methods). 
The simple two-state model fails to account simultaneously for all the 
experimental data in the case of both AB40 (Fig. 1b-e) and AB42 
(Supplementary Fig. 10), with systematic deviations for the DEST pro- 
files (Fig. 1b-d, dashed lines) and failure to reproduce AR, as a function 
of sequence (Fig. le, grey open circles). Extension to a three-site 
exchange model that adds an intermediate species either on- or off- 
pathway, introducing additional unknown rate constants connecting 
monomer, intermediate and protofibril-bound states and residue- 
specific ISN-R, values for the intermediate state, also fails to fit the 
experimental data. 

The above results led us to investigate a simple modification of the 
two-state model in which the ensemble of protofibril-bound states is 
divided into two subsets where a given residue is either in direct contact 
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Figure 2 | Kinetic schemes for monomer exchange on the surface of AB 

protofibrils. a, The protofibril-bound peptide (Moiigomer) exists in only a single 
state. b, The protofibril-bound peptide exists as a large ensemble of states such 
that each residue can be either tethered or in direct contact with the surface of 
the oligomer with K;(i) = ky*PP(i)/k,*P?(i). The circle in the diagrammatic 

representation of the states represents a single residue that is either tethered or 
in direct contact and for which three possible chain configurations are shown. 
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with the protofibril surface (Icontact) or tethered (Itetherea) by residues in 
direct contact with the protofibril (Fig. 2b). The key to understanding 
this model lies in the observation that the variations in both AR, and 
'°N-DEST profiles are residue specific. This variation cannot be 
accounted for by postulating the existence of low molecular mass 
oligomers because, in that case, the AR, and DEST effects would be 
uniform throughout the peptide sequence (as each residue would be 
subject to the same events and the same average relaxation behaviour 
when bound to the surface of the oligomers). Thus for each residue, i, 
there is a potentially different partitioning of the large number of 
conformational states into Itetherea(#) and Icontact(i). This partitioning 
is computed for a single residue at a time, the state of the other residues 
may be tethered, bound or inter-converting between the two, and the 
correlation between states at different positions is not considered. 
Critical to the simplicity of the model is the retention of the overall 
two-state association/dissociation process; the global apparent asso- 
ciation rate constant for the conversion of monomer to the protofibril- 
bound ensemble, k2PP, is given by the sum of the pseudo-first-order rate 
constants k{’?(i) and k5??(i) for the conversion of a residue i from the 
monomeric state (Imonomer) tO Ttethered(é) and Icontact(é), respectively; the 
global dissociation rate constant, kos, from protofibril-bound states to 
monomer, is the same for I;etnerea(i) and Teontact(i) for all residues i (that 
is, kop = k_, = k_2). Thus, the two-state equilibrium between monomer 
and protofibril-bound forms is preserved, and k2?P and ko represent 
ensemble averaged rates. This model adds the fewest parameters to 
describe a system with a huge number of potential configurations (2° 
and 2” for AB40 and AB42, respectively, where each residue is repre- 
sented as either tethered or in direct contact) with differing behaviour at 
each residue. Because there are so many combinations of states, the 
addition of further variation from the polydispersity of the protofibrils 
does not impact the interpretation of the data or its quantitative analysis 
in terms of the model in Fig. 2b. 

A residue-specific equilibrium between the tethered and direct- 
contact states is described by the ratio of Teontact(é) to Tsetherea(@)) 
K3(i) = KP? (i)/kyP (i). Direct inter-conversion between Icontact(i) and 
Itethered(#) is not necessary to fit the data and, hence, is not incorporated 
in the analysis presented in the main text (see Supplementary Table 1 
and Supplementary Fig. 11 for analysis including these kinetic paths). 
The !5 N—Riethered values for the Itetherea States are also residue-specific 
and describe the average dynamics of a given residue in the tethered 
state. In physical terms, this will depend upon the average length and 
flexibility of the segment separating residue i from other residues of the 
peptide in direct contact with the oligomer. 

The model in Fig. 2b is a reduced, ‘model-free’ representation of the 
actual system, which clearly comprises a large ensemble of different states 
and inter-conversion networks. Although the model is phenomeno- 
logical in that individual protofibril-bound states in the ensemble and 
the correlated behaviour of residues are not explicitly considered, the 
smooth variation of the fitted parameters as a function of residue is 
indicative of correlated behaviour at adjacent positions, as to be expected 
for a model that well describes physical behaviour. Further, the model 
does not require that a single protofibril state be present or that the 
members of the protofibril ensemble even have the same interfaces with 
bound monomers. The ability to interpret the experimentally observed 
residue-by-residue differences between AB40 and AB42 demonstrates 
the success of the model. 

Least-squares nonlinear optimization of the scheme shown in Fig. 2b 
results in excellent fits to both the '"N-DEST saturation profiles 
(Fig. 1lb-d and Supplementary Fig. 10a—d) and the AR, data (Fig. le 
and Supplementary Fig. 10e). For AB40 at 270 1M, kepP = 3.0 £0.75 — * 
in agreement with the observed value of 3.0 s | for AR} (Fig. le), and 
ko =51412s 1, in good agreement with our previous estimate of 
approximately 73s ' obtained from a one-dimensional 'H saturation 
experiment’>. The values of k*?? and ko for AB42 at 160 uM are 2.48 * 
(in agreement with 2.3 ' for AR™™) and 62s ' respectively, although 
the standard deviations in these fitted parameters are higher owing to 
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lower experimental signal-to-noise (+1.8 s 'and+46s ‘). The values 
of SN—RS™"* are 18,800 + 700s * and 19,300 + 2000s * for AB40 
and Af42, respectively, which are reasonable for the large, worm-like 
protofibrils seen in these samples by atomic force and electron micro- 
scopy (Supplementary Fig. 1). From the values of the association and 
disassociation rate constants one can deduce that the populations of 
transiently bound AB40 and AB42 are approximately 6% and 4%, 
respectively, that of free monomer. 

The residue-specific partitioning, K3(i), between the direct-contact 
and tethered states (Fig. 3a) and the associated 15) Ricthered relaxation 
values (Fig. 3b) provide the first characterization of the protofibril- 
bound forms of AB40 and Af42 at the single-residue level. 
Qualitatively the K; and SN—R'«red profiles for AB40 and AB42 
are similar, with the notable exception of the IS\— Riethered values for 
the C-terminal residues. We also note that the values of K3 and 
1S\y__Ricthered are directly correlated with the observed widths of the 
saturation profiles and AR, respectively (Supplementary Fig. 12), 
demonstrating that the model is consistent with a qualitative analysis 
of the data suggested directly from the observed residue-specific 
behaviour. 

For both AB40 and Af42 the first eight N-terminal residues have 
similar, small values of K; (0.08-0.14), indicating that these residues 
are predominantly tethered, spending only a small fraction (=10%) of 
the time in actual direct contact with the protofibril surface. Correlated 
with this finding, these residues retain considerable flexibility as 
reflected by “N—R'"eed values ranging from 20 to 80s ‘. These 
observations are consistent with '*C linewidths in solid-state NMR 
spectra of AB40 fibrils that are indicative of conformational disorder 
for the N-terminal segment", although our data indicate a small but 
measurable population in direct contact with the protofibril surface. 

The highest values of K3 (>0.24) are seen for residues 16-25 and 
28-37 of AB40 and residues 16-23 and 33-40 of AB42, corresponding 
roughly to the central and C-terminal hydrophobic regions that form 
intermolecular B-sheets at the core of amyloid fibrils”'®. Although the 
ratio of direct-contact to tethered states in the AB40 protofibril-bound 
ensemble is very similar for the two regions (0.34 + 0.06 compared 
with 0.32 + 0.04), the average /N—Rtheed values are very different: 
146 + 33s ' for residues 16-22 falling to 61+ 16s ' for residues 
23-25, and 62+ 13s ' for residues 28-37. This suggests that, in the 
tethered states, residues 28-37 of AB40 are considerably more flexible 
than residues 16-22. In terms of a physical picture this suggests that 
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Figure 3 | Comparison of residue-specific fitted parameters describing the 
ensemble of protofibril-bound states. a, Residue-specific equilibrium 

constant (K;) for the relative partitioning of direct-contact and tethered states 
on the surface of the protofibrils. b, 7 N—Ricthered values for the tethered states. 
The values for AB40 and Af42 are displayed as blue and red circles, respectively. 
The proposed two f-strand regions in fibrils of AB40 and AB42 (refs 9 and 8, 
respectively) are indicated by arrows at the top of the figure. Error bars, 1 s.d. 
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the C-terminal region of AB40 may be far from the protofibril surface 
when other residues (such as the 16-22 region) are bound, whereas the 
converse is true for the 16-22 region. For AB42, the average value of 
K; is slightly higher in the central hydrophobic region (0.35 + 0.06) 
compared with the 33-40 region (0.28 + 0.03), but in contrast to 
A40, the corresponding !°N—R''e4 average values are quite similar 
(133+33s ' and 99+35s ', respectively), suggesting that the 
tethered states of the C-terminal hydrophobic region are on average 
less mobile and closer to the protofibril surface in AB42 than in AB40. 

The most striking differences between protofibril-bound AB40 and 
Af42 are seen at the C terminus. Intermediate K; values (0.21-0.23) 
associated with low N—Riethered values (58, 24 and 23 s | for residues 
38, 39 and 40, respectively) are observed for the last three C-terminal 
residues of AB40 (Fig. 3). Thus, residues 38-40 in the ensemble of 
A40 protofibril-bound states are more mobile and hence significantly 
less often buried than the central (residues 16-25) and C-terminal 
(residues 28-37) hydrophobic regions. This is in sharp contrast to 
Af 40 fibrils, where the C-terminal residues are located at the central 
core of the fibre, forming tight intermolecular contacts with both 
neighbouring strands in the C-terminal cross-f sheet and adjacent 
subunits in the supramolecular structure’. Interestingly, the last two 
C-terminal residues of antibody-stabilized AP40 protofibrils also 
exhibit increased mobility as assessed by solid state NMR”, suggesting 
that enhanced dynamics of the C-terminal residues of AB40 are at least 
partly conserved when the protofibril-bound states are incorporated 
into the protofibril core. Although the population of direct-contact 
states for the C-terminal residues of AB42 is similar to that of AB40 
(Fig. 3a), the corresponding !*N—R'ered values are significantly 
higher for AB42 (76 + 16 s | for residues 39-42 of AB42 compared 
with 23.5+0.5 s | for residues 39-40 of AB40; Fig. 3b). Thus, the 
mobility of the C-terminal residues in the tethered states of AB42 is 
hindered relative to that in AB40. 

If dynamics of the C-terminal hydrophobic region and C-terminal 
residues in the protofibril-bound states, where AB40 and AB42 differ 
(Fig. 3), constitute the apparent free-energy barrier in the nucleation- 
dependent conformational change or polymerization separating dis- 
ordered oligomers from fast-elongating amyloid fibrils, these 
observed differences may explain the higher rate of protofibril and 
fibril formation of AB42 relative to AB40 (refs 13, 14). 

Further, although the population of direct-contact states for the 
hydrophilic region between residues 27 and 29 is reduced for both 
AB40 and Af42, the K; values are higher for AB40 than AB42 
(Fig. 3a). Because this region constitutes a hairpin turn in Af fibrils, 
the lower population of direct contact states for AB42 may facilitate 
hairpin and subsequent f-sheet formation, thereby enhancing the rate 
of incorporation of surface-bound monomers into the protofibril core. 

In summary, '"N-DEST and AR, measurements probe exchange 
processes between monomer and protofibril-bound states of AB on 
times scale of 10-100 ms, and imprint the residue-by-residue footprint 
of the ‘dark’ protofibril-bound state on the easily observed monomer. 
These experiments are complementary to pulsed H/D exchange 
experiments that have been used to probe fibril formation and recyc- 
ling on a million-fold slower timescale**®, as well as to spin-state 
selective methyl group relaxation dispersion experiments” that probe 
invisible state dynamics of supramolecular complexes on the inter- 
mediate chemical shift timescale of 50 Ls to 10 ms (ref. 28). In contrast 
to relaxation dispersion, which requires that the visible and invisible 
states have different chemical shifts, the DEST and AR, experiments 
rely entirely on large differences in transverse relaxation rates (R,). For 
systems where a relatively small particle (that is, a macromolecule 
whose resonances are directly observable by solution NMR) is in 
exchange with a state bound to a large particle (for example, an aggregate, 
a fibril, a large supramolecular complex or a solid support), large 
differences in R, values will be present and consequently the R, values 
in the ‘dark’ state can be determined by DEST. The DEST experiment 
requires only that the exchange rate (k2PP + kog) be larger than the 
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SN-R, of the observed species. The AB monomer has a SN-R, of 
approximately 1.5s ', significantly faster and hence more limiting 
than for a larger protein. Because k*PP is a pseudo-first-order rate 
constant, the use of higher concentrations can increase the apparent 
on-rate, thereby extending the range of applicability. The upper limit 
on the value of k?PP is approximately 100s ', beyond which accurate 
determination of AR, is difficult. We anticipate that the DEST tech- 
nique for examining dynamics at single-residue resolution of otherwise 
invisible ‘dark’ states will be applicable to many areas of current interest 
both in biology (for example, the interaction of unfolded proteins with 
the proteasome” and very large chaperones, the formation of low- 
affinity, large supramolecular complexes, and the folding-upon- 
binding” of intrinsically disordered proteins) and materials science 
(for example, exchange reactions on surfaces). 


METHODS SUMMARY 


Solutions of uniformly '°N-labelled AB40 and AB42 were prepared from NaOH- 
treated stocks as described previously'’. NMR samples, comprising 50 and 270 1M 
A40 and 50 and 160 11M AB42 in 50mM HEPES, pH 6.8, and 90% H20/10% 
D,O, were prepared and maintained at 4-10 °C at all times. All NMR experiments 
were conducted at 10 °C ona Bruker 900 MHz spectrometer equipped with a triple 
resonance z-gradient cryoprobe. Fitting the DEST and AR, data to various kinetic 
schemes using the McConnell equations to determine the oligomer-bound R; 
values and kinetic parameters was performed as described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Description of the two-dimensional '"N-DEST experiment and pulse 
sequence. Owing to relatively small equilibrium magnetization of '°N nuclei, 
initial amide hydrogen ('Hy) magnetization for the monomeric species is trans- 
ferred to longitudinal '°N magnetization, after which a weak '°N saturation pulse 
is applied at a series of radio-frequency offsets (ranging from +35 to —35 kHz) for 
a sufficient length of time (0.9 s) to allow the monomeric peptide to sample a 
protofibril-bound state where '°N-R, values are orders of magnitude larger than in 
the monomeric species and allow for efficient partial saturation by the weak radio- 
frequency field. Partial saturation is then transferred back to the monomeric state 
by chemical exchange, and recorded as a decrease in intensity of the 'H-'°N 
correlation cross-peaks for the NMR observable monomeric species relative to 
that in a reference spectrum obtained without saturation (see Supplementary 
Fig. 8). Because initial ‘Hy to '"N INEPT transfer of magnetization in the high 
molecular mass protofibrils is extremely inefficient owing to very fast 'H and '°N 
transverse relaxation, the observed transfer-of-saturation only arises from 
peptides that begin the experiment as monomers, are incorporated into 
protofibrils, and subsequently released within the saturation time. The pulse 
sequence for the two-dimensional heteronuclear single quantum coherence 
correlation-based '"N-DEST experiment is shown in Supplementary Fig. 8. The 
pulse sequence is similar in spirit to that commonly used for measuring '°N-R, 
longitudinal relaxation except that the variable T, relaxation delay is replaced by a 
weak, continuous-wave saturation pulse at variable radio-frequency offsets. The 
DEST experiment begins from the amide proton magnetization H, of the 
monomeric peptide. Following the first INEPT transfer, longitudinal two spin 
order 2H_.N, is obtained (Supplementary Fig. 8, point a). The subsequent refocus- 
ing INEPT provides in phase N, magnetization (Supplementary Fig. 8, point b). 
The 90° 'H pulse ¢5 not only flips the water signal back to the +z-axis, but also 
purges any residual anti-phase magnetization 2N,H, before the 'SN continuous- 
wave saturation pulse. The latter is applied for 0.9 s at a series of 15 different offsets 
(35, 28, 21, 14, 8, 4, 2, 0, -2, —4, -8, —14, —21, —28 and —35 kHz) from the !°N 
carrier frequency (set to 118.5 and 117.6 p.p.m. and located at the centre of the 
monomer spectrum of AB40 and Af42, respectively) and at two radio-frequency 
field strengths (170 and 350 Hz). In addition, two reference spectra were recorded 
at each concentration without saturation. During the continuous-wave !°N pulse, 
the magnetization (N-) originating from the monomeric peptide is partly saturated 
when the peptide samples a protofibril-bound state. The latter cannot be directly 
observed owing to extremely rapid transverse (Rz) relaxation. This partial satura- 
tion results in a reduction of the N, magnetization by a multiplicative factor « 
(relative to that in the reference spectrum with the saturation pulse turned off) 
which depends on the strength and frequency of the '°N saturation pulse as well as 
on the kinetics of the monomer-oligomer exchange process (see main text for 
more details), thereby providing valuable insight into the inter-conversion 
dynamics at atomic resolution. When the oligomer-bound peptide is released 
back into solution as a monomer, its reduced magnetization KN, is detected by 
heteronuclear single quantum correlation. It is important to note that inversion of 
the phases of both the 90° 'H pulse ¢, and the receiver phase alternates the 
orientation of the N, magnetization between the +z and -z-axes. Summation of 
transients with these alternating phases removes non-exponential contributions to 
longitudinal relaxation (for example, owing to non-zero magnetization at 
Boltzman equilibrium) so that loss of magnetization due to R, relaxation at the 
end of the continuous-wave saturation pulse is a simple scaling factor e~*’, and 
hence the R, dependence is effectively removed when cross-peak intensities are 
normalized to those in the spectrum without saturation*'. Longitudinal cross- 
correlated relaxation interference between the '"N-'H dipolar interaction and 
the '°N chemical shift anisotropy is eliminated by application of the 180° 'H pulse 
train (with an inter-pulse delay of 100 ms) during the continuous-wave '°N pulse. 
With a cryogenic probe at a 'H spectrometer frequency of 900MHz, this 
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inter-pulse delay is sufficient to return the water signal back to the + z-axis through 
radiation damping. The de-phasing INEPT following the saturation period results 
in an anti-phase term x2N,H, (Supplementary Fig. 8, point c). The anti-phase '°N 
magnetization is then converted to amide proton anti-phase magnetization. The 
last refocusing INEPT, incorporating a WATERGATE scheme, not only re-phases 
the anti-phase proton magnetization to in-phase «H,, but also suppresses the 
residual water signal before direct detection of the monomeric peptide amide 
proton magnetization. 

Data acquisition and analysis. Solutions of uniformly '°N-labelled AB40 and 
Af42 were prepared from NaOH-treated stocks as described previously’°. NMR 
samples, comprising 50 and 270 1M A40 and 50 and 160 1.M Af42 in 50mM 
HEPES, pH6.8, and 90% H20/10% D2O, were prepared and maintained at 
4-10°C at all times. All NMR experiments were recorded at 10°C using a 
Bruker Avance-III spectrometer operating at 'H_ frequency 900.27 MHz and 
equipped with a Bruker TCI z-axis gradient cryogenic probe. '°N-R, data on 
the 50 and 270uM AB40 samples and 50 and 160M Af42 samples and 
'°N-R, data at 50 [1M were measured using standard interleaved two-dimensional 
"H-}N heteronuclear single quantum coherence correlation-based NMR experi- 
ments and analysed as described previously’”. 

The two-dimensional '°"N-DEST spectra were acquired as 32 interleaved experi- 
ments (with the radio-frequency field strengths and frequencies for the continuous- 
wave '°N pulse given above). For AB40, each two-dimensional experiment 
comprises 154* X 1,900* complex data points in the indirect '°N and direct 'H 
dimensions, respectively, with corresponding acquisition time of 77 and 175.6 ms. 
For AB42, 78* X 1,900* complex points were acquired with acquisition times of 41 
and 175.6 ms. Sixteen transients for AB40 and 32 for AB42 were collected per free 
induction decay with a 1.7-s delay between scans. The total experimental times for 
AB40 and AB42 were each approximately 5 days. 

The complete interleaved two-dimensional data set was first split into 32 indi- 

vidual two-dimensional data sets, one for each experiment at a particular '°N 
saturation pulse power level and frequency, using a C program written in-house. 
Each experiment was then processed identically and analysed using NMRPipe”. 
The time-domain data in each spectrum were apodized in the indirect '"N dimen- 
sion with a 54°-shifted sine bell function. For the direct 1H dimension, an expo- 
nential window function with a line broadening of 5 Hz for AB40 and 8 Hz for 
A 42 was applied, followed in the case of the AB40 data by a 72°-shifted squared 
sine bell. The additional line-broadening for the AB42 data was used to improve 
the signal-to-noise owing to the very low (approximately 35 ,1M) concentration of 
free AB42 monomer. Time-domain data in the °N and 'H dimensions were then 
zero-filled to obtain a final complex 1024* x 4096* data matrix, with digital 
resolutions of 2.0 and 2.6 Hz, respectively. The residue-specific signal reduction 
factor x as a function of frequency offset of the continuous-wave '°N saturation 
pulse was obtained from the ratio of peak heights in the saturation experiment 
versus the reference experiment with the saturation pulse turned off. (The Bruker 
pulse sequence and acquisition parameter files for the '"N-DEST experiment, as 
well as the processing scripts, are available on request.) 
Simultaneous fitting of the DEST and AR, data to kinetic models. A homo- 
genous form of the McConnell equations” describing a single spin in chemical 
exchange, as presented in equation (21) of ref. 23, was expanded to a three-state 
system. Owing to the small gyromagnetic ratio of '°N nuclei and their sparse 
density in proteins, cross-relaxation between neighbouring '‘°N nuclei in both 
monomer and protofibril-bound states is negligible, in marked contrast with the 
one-dimensional 'H transfer-of-saturation profiles reported previously". Effects 
on the saturation owing to cross-relaxation between ‘°N and directly bonded 'H 
nuclei, though predicted to be small, are removed with phase-cycling, placing the 
initial '"N magnetization during the saturation period on the +z and ~z-axes in 
alternating transients and measuring their intensity difference. The time depend- 
ence of the magnetization is given by: 


rE/27 7 0 0 0 0 0 0 0 0 0 0 7 fe/27 
TA 0 W+tkh+kh QA ay —k_, 0 0 = hug 0 0 i 
ib 0 a Mtkhth Ox 0 —k_y 0 0 —k_y 0 i 
i —204 Wy —Ox prtk +k 0 —k_y 0 0 —k_, oa 
ad| B 0 —k 0 0 tka tks Qi Wy ks 0 0 rp 
dt| I} 0 0 —k, 0 —2 WB 4 k_y+ks Ox 0 —k 3 0 14 0) 
i —208 0 0 —k Wy Ox pe+kitks 0 0 —k_3 ze 
Ig 0 -k 0 0 —k; 0 0 AC +k_2+k_3 Qe oy ic 
i 0 0 —ky 0 0 —ks 0 —ae 2 +k_o +k-3 Ox Ty 
Li | |-20° 0 0 —k, 0 0 —k; Wy Oy pot+k_.+k_3] | If J 


where I represents the magnetizations of the '°N spins in the rotating frame in the 
free monomer, and in the tethered and direct-contact protofibril-bound states 
(denoted by the superscripts A, B and C, respectively). The relaxation rates for 


transverse and longitudinal magnetization are 4 and p, respectively; Q is the 
resonance offset frequency calculated as the difference between the peak res- 
onance frequency (accounting for peak folding in the case of AB42 Ala 42) and 
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the '°N carrier frequency + continuous-wave pulse offset, and w the radio- 
frequency field strength of the continuous-wave saturation pulse applied along 
the appropriate axis; E is unity; and @ is related to the equilibrium magnetization 
as described by Helgstrand et al.””. The rate constants are as follows (see Fig. 2b): 
k, and k, are the apparent first-order rate constants for the forward reaction from 
the monomer to the tethered and direct-contact protofibril-bound states, respect- 
ively; kK_, and k_, are the corresponding dissociation rate constants and in this 
instance k_, = k_ (see main text); k, and k_; (not shown in Fig. 2b) are the rate 
constants for the conversion of tethered to direct-contact states, and vice versa, 
respectively. Detailed balance is ensured from the relationship k_3 = ksk,/ky 
(with k_, = k_2). The numerical solution for I“, the experimentally observed 
magnetization in the DEST experiment, after a given saturation time as a function 
of frequency offset was calculated using the matrix exponential function in the 
program Matlab. The solution is calculated for both initial '"N magnetization 
during the saturation period on the +z and -z-axes, I, and I_,, and the differ- 
ence, I, — I_,, is then computed at each frequency offset and radio-frequency 
field strength, normalized to the solution without saturation, and compared to the 
normalized experimentally observed intensity. 

AR; values, for a given set of kinetic and relaxation rate constants, were calcu- 
lated by subtracting the monomer 'SN-R, values for each residue, measured 
experimentally on the monomeric 50 UM samples, from the calculated R2 values. 


The calculated R; value for a system in equilibrium exchange is computed assum- 
ing a single exponential decay: 


wt =— (8) rn 


where the transverse magnetization, i ar is computed at time t = 0.2s and time 0. 
The choice of 0.2s for t was to match to one of the longer delays used in the 
experimental '"N-R, measurements. Simulated delays longer than approximately 
20 ms result in the same value of R$", validating the assumption of single expo- 
nential decay. 

1SN relaxation rates in the protofibril-bound states, and kinetic rate constants 
connecting monomer and protofibril-bound states in all schemes (cf. Fig. 2), were 
fitted by simultaneous nonlinear least squares minimization of the differences 
between the observed and calculated (equation (1)) DEST profiles (at both applied 
radio-frequency field strengths for the continuous-wave '°N saturation pulse) and 
between the observed and calculated (equation (2)) AR, values. 
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Aspartate 112 is the selectivity filter of the human 
voltage-gated proton channel 


Boris Musset'*, Susan M. E. Smith?*, Sindhu Rajan*, Deri Morgan!, Vladimir V. Cherny’ & Thomas E. DeCoursey! 


The ion selectivity of pumps and channels is central to their ability 
to perform a multitude of functions. Here we investigate the mech- 
anism of the extraordinary selectivity of the human voltage-gated 
proton channel’, Hy1 (also known as HVCN1). This selectivity is 
essential to its ability to regulate reactive oxygen species production 
by leukocytes”*, histamine secretion by basophils’, sperm capacita- 
tion®, and airway pH’. The most selective ion channel known, Hy1 
shows no detectable permeability to other ions’. Opposing classes of 
selectivity mechanisms postulate that (1) a titratable amino acid 
residue in the permeation pathway imparts proton selectivity’*", 
or (2) water molecules ‘frozen’ in a narrow pore conduct protons 
while excluding other ions’*. Here we identify aspartate 112 as a 
crucial component of the selectivity filter of Hy1. When a neutral 
amino acid replaced Asp 112, the mutant channel lost proton spe- 
cificity and became anion-selective or did not conduct. Only the 
glutamate mutant remained proton-specific. Mutation of the 
nearby Asp 185 did not impair proton selectivity, indicating that 
Asp 112 has a unique role. Although histidine shuttles protons in 
other proteins, when histidine or lysine replaced Asp 112, the 
mutant channel was still anion-permeable. Evidently, the proton 
specificity of Hy1 requires an acidic group at the selectivity filter. 

Voltage-gated proton channels are considered specific (perfectly 
selective) for protons, because no evidence exists for permeation of 
anything but H™. Specificity, combined with a large deuterium isotope 
effect’ and extraordinarily strong temperature dependence of conduc- 
tion’ suggests a permeation pathway more complex than a simple 
water wire, as exists in gramicidin’ . All proton conduction seems con- 
sistent with a hydrogen-bonded chain (HBC) mechanism"; a HBC 
including a titratable group could explain several unique properties of 
Hy] (ref. 1), especially proton selectivity’*. Yet in a recent study, muta- 
tion of each titratable amino acid in all four transmembrane helices of 
Hy]1 failed to abolish conduction’*. Thus, the mechanism producing 
proton selectivity remained unknown. 

We noticed that a human gene, C150rf27 (of unknown function), 
contains a predicted voltage sensor domain (VSD) that shares 25% 
sequence identity and 52% similarity (http://www.ebi.ac.uk/Tools/ 
emboss/align/needle) with the VSD of Hyl, and includes three Arg 
residues in the $4 transmembrane helix that are conserved among all 
known Hy1 homologues. Phylogenetic analysis of VSD sequences (Sup- 
plementary Fig. 1) reveals that a group comprising Hy1, Cl5orf27 and 
voltage-sensitive phosphatase (VSP) sequences separated early from 
the two phylogenetically distinct groups of depolarization activated 
VSDs described previously (Ky channels and Nay/Cay channels), 
supporting the modular evolution of VSD-containing proteins’’. 
Furthermore, Hy1 VSDs occupy a discrete lineage, distinct from those 
of VSP and Cl5orf27 orthologues. 

When we cloned the C15orf27 gene and expressed the product in 
HEK-293 or COS-7 cells, the green fluorescent protein (GFP)-tagged 
protein localized at the plasma membrane (Supplementary Fig. 2), but 
we detected no currents beyond those in non-transfected cells. We 


reasoned that substitutions based on sequence elements that differ 
between Hyl and Cl5orf27 should be structurally tolerated while 
revealing residues responsible for proton conduction. We therefore 
mutated residues that are perfectly conserved in 21 Hy1 family members 
and differ between Cl5orf27 and Hyl. We replaced five candidate 
residues in Hy1 (D112, D185, N214, G215 and $219) (Fig. la, b) with 
the corresponding residue in C15orf27. Four mutants exhibited large 
currents under whole-cell voltage clamp (Fig. 1d). The reversal (zero 
current) potential, V,.,, measured at several pH, and pH; (external and 
internal pH, respectively), was close to the Nernst potential for protons, 
Ey (Fig. 1c), demonstrating proton selectivity. D112V mutants localized 
to the plasma membrane (Supplementary Fig. 3), but showed no con- 
vincing current (Fig. 1d). Some D112V-transfected HEK-293 or COS-7 
cells (and non-transfected cells) had small native proton currents. 
H140A/H193A double mutants'™!”, in which the two Zn’* -binding 
His residues are neutralized, resemble wild type, with similar ApH- 
dependent gating’, and V,., near Ey (Supplementary Fig. 4). We 
expressed mutants in this Zn**-insensitive background (D112X/A/ 
A) to distinguish their currents from native currents that are abolished 
by 100 1M Zn** at pH, 7.0. We tentatively concluded that Asp 112 is 
crucial to proton conduction. 

The absence of detectable currents in D112V led us to make other 
D112X substitutions. These mutants (Fig. 2a) showed slowly activating 
outward currents upon depolarization that resembled Hy1 currents. 
As reported previously’*, Asp 112 mutation had little effect on the ApH 
dependence of gating. The proton conductance-voltage (gy-V) rela- 
tionship of all D112X mutants shifted roughly -60mV when pH, 
increased from 5.5 to 7.0 (Supplementary Fig. 5), as in wild-type 
channels**'*. Mutation of Asp112 did influence channel opening 
and closing kinetics (Supplementary Table 2). 

Measurements of V,., in Asp 112 mutants showed a marked departure 
from wild-type Hy1 properties. At symmetrical pH 5.5, Vie, was near 
0 mV (not shown). At pH, 7.0, pH; 5.5 (Fig. 2a, column 3), wild-type 
channels reversed near Ey; (—87 mV), indicating proton selectivity. 
But for all mutants except D112E, V,., was substantially positive to 
Ey (Fig. 2b), ranging from —58mV (D112H) to —13 mV (D112N). 
Substitutions at Asp 112 eliminated the proton specificity that distin- 
guishes Hy1 from all other ion channels’. A previous study described 
currents in D112A and D112N mutants”, but did not report V,ey. 

We expected that loss of proton selectivity would result in nonselec- 
tive permeation of cations. Surprisingly, V,., did not change detectably 
when Na‘, K*, N-methyl-p-glucamine* or TEA™ (tetraethylammo- 
nium”) replaced TMA’ (tetramethylammonium* ) (Supplementary 
Table 4). To test anion against cation selectivity, we adopted the classical 
tactic of replacing a fraction of the bath solution with isotonic sucrose”. 
The Nernst equation predicts that dilution of all extracellular ions 
except H® and OH" (leaving internal ion concentrations unchanged) 
will shift V..y negatively for a cation-selective channel, but positively for 
an anion-selective one. Despite the tenfold reduction of buffer concen- 
tration, direct measurement confirmed that pH remained constant. 
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Figure 1 | Identification of five key amino acids that differ in Hy1 and 
C15orf27, and the currents generated in a heterologous expression system 
by Hy1 mutants in which Hy1 residues were replaced by the corresponding 
amino acid in the non-conducting C15orf27. a, Representative subset of 
multiple sequence alignment of 122 VSDs, only transmembrane helices are 
shown. Gene families include Hy1, voltage-sensitive phosphatases, C15orf27, 
Ca** and Na* channels, and K* channels (see Supplementary Fig. 1). 

b, Location of the key amino acids in the open Hy1 channel VSD viewed from 
the side (membrane), based on a homology model”. ¢, Vyey in the four 


Figure 3 illustrates determination of V,., from tail currents in a 
D112H-transfected cell at pH 5.5//5.5 (pH.//pH;) in methanesulpho- 
nate (CH3SO3 ; Fig. 3a) or Cl solution (Fig. 3c), and after 90% 
reduction of external ionic strength (Figs 3b, d). Surprisingly, for 
all Asp 112 mutants, sucrose shifted V,., positively (Supplementary 
Fig. 6), indicating anion selectivity both in CH3SO3 (Fig. 3e) and Cl” 
solutions (Fig. 3f). For Hy1 and D112E, V,., did not change, reaffirm- 
ing their proton specificity. Neutralization of a single Asp residue 
converts a proton channel into a predominantly anion selective 
channel. Thus, Asp 112 mediates charge selectivity as well as proton 
selectivity. 

To confirm anion permeability of Asp 112 mutants, we replaced 
the main external anion, CH3SO3 , with Cl. Consistent with previous 
studies', V,., in Hy1 was unchanged. As shown in D112H (Fig. 3a, c), 
Viev shifted negatively in CI solutions in all mutants (except D112E), 
indicating that Cl is more permeant than the larger CH3SO3 anion 
(Fig. 3g). That all conducting non-acidic mutants showed Cl per- 
meability indicates that Asp 112 mediates not only proton selectivity, 
but also charge selectivity. Currents were smaller than wild type in cells 
expressing some mutants (Supplementary Fig. 7), suggesting a smaller 
unitary conductance. Evidently, these channels conduct anions, but 
not very well. 

Although the mutant channels have diminished selectivity, V,.. did 
shift negatively when pH, increased from 5.5 to 7.0 (Fig. 2b). Because 
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conducting mutants is near Ey; (dashed line), indicating proton selectivity. Viey 
was measured using tail currents; in G215A Vey was positive to threshold and 
was observed directly. d, Voltage-clamp current families in cells expressing Hy1 
mutants. Depolarizing pulses were applied in 10-mV increments from a 
holding voltage, Vioig = —40 mV (D185M, D112V), —60 mV (G215A, $219P), 
or —90 mV (N214D), with the most positive pulse labelled. After membrane 
repolarization, an inward ‘tail current’ is seen as channels close (see inset for 
D185M); pH is given as pH,//pH;. D112V showed no clear current. 


these solutions differ mainly in buffer species and concentrations of 
H* and OH’, Asp 112 mutants must have significant permeability to 
H* and/or OH”. The Goldman-Hodgkin-Katz equation shows how 
Vey depends on ion concentrations: 


Vrev = 


RT ' Pa- [Cl]; ++ Pcu,soz [CH3SO, ]; + Por- [OH ];+Py+ [H*], 
ZF Spa [Cl], + Pcu,so; [CH3SO; ], + Pou- [OH], + Py [H*], , 


where R is the gas constant, T the absolute temperature (Kelvin), z the 
ionic valence (= 1), F is Faraday’s constant and P is permeability. 

Ions with greater permeability dominate V,.,. Permeation of Ht 
and OH are difficult to distinguish because they have the same Nernst 
potential’. The data can be interpreted assuming permeation of either 
(Supplementary Table 3), but the anion selectivity of Asp 112 mutants 
and the pH dependence of sucrose effects (Figs 3e, f) support OH 
permeation. The relative permeability of conducting Asp 112 mutants 
was OH’ (or H*) > Cl > CH;SO;. 

Although Asp 112 is essential to selectivity, other acidic groups 
might participate. We mutated Asp 185, located in the presumed con- 
duction pore (Fig. 1b)'*’’. However, like D185M (Fig. 1b), D185V, 
D185A and D185N remained proton-selective (Supplementary Fig. 8). 
As evidence against additive effects, the double mutant D112N/ 
D185M did not differ from D112N (Supplementary Fig. 8). 
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Figure 2 | Currents in Asp 112 mutants resemble proton currents, but are 
pH. 5.5 pH. 7.0 pH. 7.0 not. a, Currents generated by wild type (WT), D112E, D112H, D112K/A/A, 
2 7 . D112N/A/A, D112S, D112A/A/A and D112F/A/A in COS-7 cells (pH; 5.5) at 


War yo 30 ~30 -70 pH, 5.5 (column 1) or 7.0 (column 2), during families of pulses in 10 mV 
_ increments up to indicated voltages. Tail currents at pH, 7.0 (column 3) reveal 


mV) 


Change in V,.., ( 


that V,., deviates from Ey, indicating loss of proton selectivity. At pH, 5.5 Viola 
—150 pA —150 pA —|1pA was —40 mV (—60 mV for WT). At pH, 7.0 Viola was —40 mV (K, N, $), 
—50mvV (F), —60 mV (H, A), —80 mV (E), or —90mV (WT); Vpre was 
0 —65mV (WT), —40 mV (E), —10 mV (H), +50 mV (K), +40 mV (N, S), or 
+20 mV (A, F). V,.y (arrows) was determined from the amplitude and 
direction of tail current decay. For D112N, Viey was above Vihreshola and was 
evident during pulse families. b, Shift in Ve, when the TMACH3SO3 bath 
—150 pA solution was changed from pH.5.5 to 7.0. There is no difference between WT 
0.1s and D112E, but the shift in all other mutants is smaller than WT (P < 0.001, by 


60 40 -50 one-way ANOVA followed by Tukey’s test, n = 7, 4, 9, 8, 6, 7, 9 and 4). Error 
Ze —— Mele bars in b are s.e. Dashed line shows Ej. 
100 90 : : ; _— ; r P 
Consistent with earlier predictions that a titratable amino acid pro- 
vides the selectivity filter of Hy1 (refs 1, 8-11), only channels with 
“<0 acidic residues (Glu or Asp) at position 112 manifested proton spe- 
— 
0 


cificity. Asp 112 lies at the constriction of the presumed pore (Fig. 1b), 
a logical location for a selectivity filter, and just external to the postu- 
lated gating charge transfer centre”. Our original prediction envi- 


. = as sioned selectivity arising from protonation/deprotonation of a 
_ pas residue during conduction, but other mechanisms are possible. For 
example, proton selectivity of the influenza A M, viral proton channel 
1 20°pA _15 pA 12.5 pA has been explained by (1) immobilized water’', (2) successive proton 
2s 2s 1s transfer and release by His 37 (refs 22, 23) and (3) delocalization of the 

tie ” 20 proton among His 37 and nearby water molecules”. 
Zz The Cl permeability of D112H was completely unexpected, given 
el strong precedents for His imparting proton selectivity to channels. 
— 20 pA 20 pa ___|20 pa Histidine shuttles protons in K* or Na* channel VSDs with Arg—His 
23 es ca mutations”, in carbonic anhydrase” and in M, channels”””*. However, 
70 20 10 these molecules are not proton-specific’’”’. Evidently, His shuttles 
ae == protons, but does not guarantee proton selectivity. In Hyl, Asp 112 
= ms (or Glu112 in D112E) excludes anions, resulting in proton-specific 
+ 25 pA __110 pA 7 — 10 pA conduction. When protonated, Glu and Asp are neutral whereas His is 

s 2s s 


cationic, which may explain why D112H fails to exclude anions. 
The anion selectivity of neutral Asp 112 mutants indicates that elec- 


40 
o trostatic forces due to the charge distribution in the rest of the channel 
ae deter cation permeation, and that the cation selectivity of the wild-type 

— channel is due to the anionic charge of Asp 112. Asp 185 does not 


__|10 pa __}10pA_i5pa participate directly in selectivity (Supplementary Fig. 8). VSP family 
10s 1s 0.58 members possess the equivalent of Asp 112 (Fig. 1a), yet conduct no 
current’’, illustrating that Asp 112 requires a specific microenviron- 
D112K D112N D112A D112F ment to achieve selectivity. Although permeation of Cl and CH;SO3 

H140AH140A H140A H140A ‘ . : Pair 
WT D112E D112H ee oan D112S ican tt93a suggests a wide pore in D112X mutants, local geometry might differ in 


0 wild-type channels due to the presence of anionic Asp 112. 

Regulation of voltage gating by ApH is distinct from permeation. 
Pathognomonic of Hy] is a strict correlation between the gy-V rela- 
tionship and V,.,, in which Vipreshola Shifts 40 mV per unit change in 
ApH (ref. 8). The ApH dependence persisted in mutants with shifted 
gu-V relationships’. Here we show uncoupling of V,., and voltage 
gating. Asp112 mutants retained normal ApH dependence (Sup- 
plementary Fig. 5), despite the dissociation of Vy from ApH (Fig. 2 
and Supplementary Fig. 9). This uncoupling of pH control of gating 
from permeation speaks against any mechanism that invokes regu- 
lation by local proton concentration in the vicinity of $4 Arg residues”’. 

In summary, Asp 112 is a critical component of the selectivity filter 
of Hy1, crucial to both proton selectivity and charge selectivity. That 
D112E was proton-selective, but D112H conducted anions indicates 
that this proton channel requires an acid at the selectivity filter. That 
neutralization of nearby Asp 185 did not affect selectivity suggests that 
Asp 112 has a unique role. 
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Figure 3 | Dilution of ionic strength by 90% with isotonic sucrose shifted 
Vey positively, indicating that most Asp 112 mutants are anion-selective. 
a, Measurement of V;ey by tail currents in a cell transfected with D112H at 
pH5.5//5.5, and b, after sucrose. c, V,., in the same cell in pH5.5 CI solution, 
and d, after sucrose. Arrows indicate zero current. Viola = —40 mV, 

Vpre = +60 mV. e, Mean shifts of V,-, with decreasing ionic strength in 
CH3SQO3 solutions or f, in Cl solutions. Each value was determined in 3-6 
cells. X = not done. g, Shifts of V,.~ when CH3SO3 was replaced by Cl. Values 
for WT and D112E do not differ significantly from 0 mV. For all anion-selective 
mutants except D112N, the difference between shifts at pH 5.5 and 7.0 was 
significant (P < 0.001, one-way ANOVA followed by Tukey’s test; n = 3-8). 
Error bars in e-g are s.e. 


METHODS SUMMARY 


The pipette solution (also used externally) contained (in mM) 130 TMACH;SOs, 2 
MgCl, 2 EGTA, 80 MES (2-(N-morpholino)ethanesulphonic acid), titrated to 
pH5.5 with ~20 TMAOH. In the pH5.5 TMACI solution, TMACI replaced 
TMACH;SO;3. Bath solutions at pH 7.0 had (mM) 90 TMACH;SO; or TMACI, 
3 CaCl, 1 EGTA, 100 BES and 36-40 TMAOH. For experiments with Zn’*, 
solutions contained PIPES without EGTA. Experiments were done at 20-25 °C. 
Currents are shown without leak correction. V,.y data were corrected for liquid 
junctions potentials measured in each solution”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Exhaustive searches to identify Hyl homologues were performed using protein 
BLAST and PSI-BLAST. A sample of VSDs from K*, Na* and Ca?" channels 
(that open with depolarization like Hy1, and in addition one that opens with 
hyperpolarization), along with putative Hy1, VSP and C15orf27 homologues were 
chosen. For cation channels, we sampled from the range of subfamilies, from the 
VSD repeats within Na* and Ca** channels, and from the range of species. VSD 
sequences, including crystallized K* channels (PDB accessions 1ORS, 2R9R and 
2A79), were aligned using PromalS3D*’, which incorporates structural informa- 
tion, allowing high-confidence identification of VSD boundaries. Sequences were 
trimmed to the VSD, realigned with PromalS3D, and the resulting alignment was 
analysed with PhyML (maximum likelihood)” and Protpars (maximum par- 
simony)** at the Mobyle portal**. Trees were visualized with TreeDyn*’ and 
iTOL”. Parsimony (not shown) and maximum likelihood trees had similar topo- 
logy, including Hy1 and Cl5orf27 families separating into discrete branches. A 
homology model of the VSD of Hy1 was constructed as described previously’’. 
The C15orf27 clone was PCR-amplified from human cerebellum and subcloned 
into pcDNA3.1(+) expression vector (Invitrogen). The coding sequence of 
human Hyl (HVCN1) was cloned into either pcDNA3.1(—) or pQBI25-fC3 (to 
make GFP-Hy1) vectors as described previously'®. Site-directed mutants were 
created using the Stratagene QuikChange (Agilent) procedure according to the 
manufacturer’s instructions. All the positive clones were sequenced to confirm the 
presence of the introduced mutation. HEK-293 or, more often COS-7 cells were 
grown to ~80% confluency in 35-mm cultures dishes, usually by seeding cells 
1 day ahead of transfection. Cells were transfected with 0.4-0.5 1g of the appro- 
priate cDNA using Lipofectamine 2000 (Invitrogen). After 6 h at 37 °C in 5% COs, 


the cells were trypsinized and replated onto glass cover slips at low density for 
patch clamp recording the following day. We selected green cells under fluor- 
escence for recording. Patch-clamp methods were described previously’®. 

The main pipette solution (also used externally) contained (in mM) 130 
TMACH;SO3, 2 MgCl, 2 EGTA, 80 MES, titrated to pH5.5 with ~20 
TMAOH. In the pH5.5 TMACI solution, TMACI replaced TMACH;SO3. Bath 
solutions at pH 7.0 had (mM) 90 TMACH;SO3 or TMACI, 3 CaCh, 1 EGTA, 100 
BES, and 36-40 TMAOH. For experiments with Zn**, solutions contained PIPES 
buffer” without EGTA. Experiments were done at 20-25 °C. Currents are shown 
without leak correction. V,.y data were corrected for liquid junctions potentials 
measured in each solution”. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/naturel10675 


A draft genome of Yersinia pestis 
from victims of the Black Death 


Kirsten I. Bos, Verena J. Schuenemann, G. Brian Golding, 
Hernan A. Burbano, Nicholas Waglechner, Brian K. Coombes, 
Joseph B. McPhee, Sharon N.DeWitte, Matthias Meyer, 

Sarah Schmedes, JamesWood, David J. D. Earn, 

D. Ann Herring, Peter Bauer, Hendrik N. Poinar 

& Johannes Krause 


Nature 478, 506-510 (2011) 


In this Letter, the GenBank accession number was wrongly printed 
as SRA045745.1. It should be SRA045745. Supplementary Tables 3a 
and 4 have also been corrected. 
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The German government and private sector are pouring money into energy research and infrastructure, such as this solar-energy plant in North Frisia. 


Renewables revolution 


Europe’s largest economy is boosting research on alternative energy sources and 


generating job opportunities. 


BY QUIRIN SCHIERMEIER AND 
KATRIN KOHNERT 


arcus Bar is a prime target for head- 
Mites: An electrical engineer, he 
leads an independent young-inves- 
tigator group at the Helmholtz Centre Berlin 
for Materials and Energy (HZB), working on 
next-generation, high-performance thin-film 
solar cells. But his research experience has 
also attracted the interest of private-sector 
solar-energy companies throughout Germany, 
which are aggressively recruiting skilled sci- 
entists and engineers. Bar has received some 
tempting job offers, he says — but has opted 
to stay in his tenure-track academic position. 
Solar research at the HZB is one of many 
alternative-energy initiatives to get under way 
in Germany in recent years. Energy research 


has long been a priority for the German 
government, but the meltdown in March at the 
Fukushima Daiichi nuclear plant in Japan gave 
it extra urgency. Before the disaster, nuclear 
energy had supplied some 23% of Germany's 
electricity demand. But immediately after the 
meltdown, the German government withdrew 
the licences of seven older nuclear reactors (see 
Nature 472, 505; 2011), and later decided it 
would shut down the nine remaining reactors 
by the end of 2022. This has added impetus to 
plans for investment in energy research over 
the next several years. 

In August, just weeks after announcing the 
nuclear phase-out, the government approved a 
€3.5-billion (US$4.7-billion) energy-research 
programme for 2011-14 — a 75% increase 
over the 2006-09 funding period. Of that, 
some €2.5 billion is earmarked for research on 


renewable energy and energy efficiency. 

This means plenty of new jobs, for researchers 
at private companies as well as those at institutes 
that rely on government funding. Analysts with 
the German Institute of Economic Research 
in Berlin estimate that over the next decade, 
several hundred thousand skilled jobs could 
be generated in the renewable-energy sector, 
many for scientists and engineers. Companies 
and institutes will be recruiting from home and 
abroad; indeed, bringing in more foreign talent 
is a government policy. 


VARIETY AND SCOPE 

The job market spans a multitude of topics in 
basic and applied research, says Bernd Emonts, 
deputy head of the Institute of Energy and Cli- 
mate Research at the Jiilich Research Centre 
(FZJ), who oversees more than 100 people > 
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> working on fuel cells. Future work will focus 
not only on how to gain energy from renewable 
sources, but also on how to store and distribute 
electricity. The government's energy investment 
will allow the FZ] to expand nascent research 
on the potential of batteries and hydrogen for 
efficient energy storage. As energy research 
becomes a national 
priority, the centre 
will seek scientists 
and engineers from 
disciplines including 
material sciences, 
physical chemistry, 
chemical and mate- 
rial engineering and 
mathematics. 

“The market for 
new technologies, 
and the demand for 


“For me, the 


young researchers change to 

who can develop it, industry has 
will boom for many beenthe right 
years to come,’ says Careermove at 


the right time.” 
Aleksandra SaSa 
Bukvié-Schafer 


Joachim Fuchs, pro- 
gramme manager for 
renewable energies at 
the Karlsruhe Insti- 
tute of Technology (KIT), which will benefit 
from both the public funding and industry 
investment. In October, the institute opened 
the Battery and Electrochemistry Laboratory 
(BELLA) in partnership with chemical giant 
BASE, which is based in Ludwigshafen. Over 
the next 5 years, the KIT and BASF each plan 
to invest €6 million in energy-storage and elec- 
tric-car research at the centre, which is about to 
advertise for 4-6 postdoctoral scientists. 

Ideal candidates, says Jiirgen Janek, a physical 
chemist and co-director of BELLA, would 
have a background in electrochemistry and 
skills in solid-state chemistry, nanotechnology 
or material sciences. But those with a general 
background in, for example, physics or inor- 
ganic chemistry could also find opportunities: 
employers can often train such candidates rela- 
tively quickly on the job, says Janek. 


INDUSTRIAL OPPORTUNITIES 

Renewable-energy technology in Germany 
was once the domain of small and medium- 
sized companies. But in September, one of the 
country’s largest and most research-intensive 
companies — Siemens, headquartered in 
Munich — announced plans to withdraw 
from nuclear power and focus on the boom- 
ing renewables sector. Michael Weinhold, 
chief technology officer of Siemens Energy in 
Erlangen, says that the company will focus on 
everything from developing small technical 
components to installing large facilities such as 
offshore wind farms and solar thermal power 
stations, and ultra-high-voltage technologies 
for optimized grids and improved energy 
transmission. Siemens Energy is constantly 
looking for well-trained young scientists and 


engineers, ideally with expertise in renewable 
energies, but also with more general bachelor’s 
or master’s degrees in electrical, chemical or 
mechanical engineering. Weinhold declined 
to provide exact hiring numbers. 

Janek says that with so many groups and 
companies currently recruiting, “the market 
for highly skilled specialists is pretty much 
swept clean”. Rapid growth in the field can 
mean a high turnover of personnel. The Fraun- 
hofer Institute for Wind Energy and Systems 
Technology in Kassel and Bremerhaven, for 
example, hired around 50 specialists this year, 
but it also lost 15 scientists to industry. Some 
academics have recognized the opportuni- 
ties. Aleksandra Sasa Bukvic-Schafer last year 
moved from the Fraunhofer Institute to SMA 
Solar Technology in Niestetal, a leading pro- 
vider of photovoltaic technologies. “For me, 
the change to industry has been the right career 
move at the right time,” she says. “I’m still 
involved in research on electrochemical energy 
storage, but my work here is more focused, less 
project-oriented — and better paid.” Indeed, 
in response to increasing demand, salaries for 
industry-employed engineers have risen by 
more than 4% this year. Starting salaries range 
from an average of €42,000 for engineers with 
bachelor degrees to €52,000 for PhD holders. 


GAP IN THE JOB MARKET 
German technology companies, however, are 
facing a severe shortage of skilled engineers. 
The Association for German Engineers (VDI) 
in Diisseldorf reported that September saw a 
record number of private-sector vacancies 
— almost 100,000, 
many of them in 
renewable energy 
and electro-mobility. 
Bosch Solar Energy 
in Erfurt — a divi- 
sion of Bosch, one 
of the largest tech- 
nology companies 


in Germany — is 

injecting €530 mil- 7 
lion into a centre | : 

for photovoltaics in Electrical and 
Arnstadt. The centre mechanical 
willhave upto 1,000 elgineers are 
skilled technicaland particularly 
engineering jobs by sought after.” 
2012. “Electrical and — Lars Funk 


mechanical engineers 

are particularly sought after,” says Lars Funk, 
head of the VDI’s profession and society divi- 
sion. “That doesn’t mean that companies are 
cutting back on requirements with regards to 
applicants’ skills. But employment opportuni- 
ties are considerably improving” 

Next year, to make foreign hires more 
attractive to companies, the government is to 
lower the minimum annual salary that com- 
panies must pay specialists from outside the 
European Union (EU) before they can get a 
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permanent work permit in Germany from 
€66,000 to €40,000. 

Until now, foreign recruitment has been less 
successful than the government hoped. Com- 
panies are often unsure about the value of for- 
eign degrees and qualifications, especially with 
regard to applicants from outside the EU. And 
employers must deal with work permits and 
pay relatively high salaries for skilled non-EU 


researchers. Academic jobs, which usually offer 
fixed-term appointments, are generally a better . 


prospect for foreign scientists. Residence per- 
mits are easier to get — scientists can apply for 
a researcher's visa — and there is no minimum 
salary required to obtain a work permit. 


DEGREE OF DIFFERENCE 

To help to meet the demand, German research 
and applied-science universities offer more 
than 100 postgraduate renewable-energy pro- 
grammes — covering aspects from basic sci- 
ence to energy economics and environmental 
management. Some programmes focus on 
specific technologies; for example, the Tech- 
nical University of Berlin runs a master’s pro- 
gramme in engineering for solar technology. 
Others, such as the University of Oldenburg’s 
master of science in renewable energy, provide 
basic knowledge across different sectors. Most 
programmes are taught in English and encour- 
age foreign students to apply. After the course, 
job prospects are good: most graduates of the 
Oldenburg programme go on to PhD research 
or take up positions in industry; others move 
on to careers in climate diplomacy or financial 
consulting, for instance, says Konrad Blum, an 
energy researcher at Oldenburg. 

Prospective students should look into the 
curricula of the different programmes to see 
whether they provide qualifications relevant 
to the student's desired career path, says Theo 
Buehler, a career expert at the Wissenschaft- 
sladen Bonn, a private ‘science shop’ that pro- 
vides advice on the job market and qualification 
requirements in energy and environment sec- 
tors. To capitalize on the looming personnel 
shortage, the science shop recommends that 
students acquire solid core knowledge in a 
particular field, such as chemistry, before going 
on to specialize in renewable energy through 
a postgraduate programme. Language skills 
and flexibility are prerequisites, and business 
skills are an increasingly sought-after extra, 
says Buehler. 

Bar knows that his own students and post- 
docs at the HZB are not immune to the lure of 
industry. “The best ones are the first to leave,” 
he says. “So you've got to make sure that there 
is always enough fresh talent around. But look- 
ing at the many Asian, Middle Eastern and 
Latin American faces in my lectures, I'm not 
at all concerned that the well might run dry.’ m 


Quirin Schiermeier is Nature’ Germany 
correspondent. Katrin Kohnert is a former 
Nature intern, based in Munich. 
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Unleash your 
inner dummy 


There is something to be said for letting go of the 
mantle of expert, argues Peter Fiske. 


valued above all else. Scientists spend 

years in graduate programmes, studying, 
teaching and researching, to become experts 
in their fields. Having invested so many years 
in developing our expertise, we naturally 
assume that it is the main thing that we can 
offer the world. Some assume that where we 
lack expertise, we cannot add any value. 

The structure and culture of research tends 
to reinforce this idea. In papers, scientists 
reference only the leading experts. It is next 
to impossible for someone who is not recog- 
nized to get funding — no matter how good 
their grant application. Fear of failure causes 
many of us to avoid environments in which 
we are not experts. There may be no such 
thing as a stupid question in primary school, 
but in graduate school one can be made to feel 
quite uneasy about asking a basic question. 

Ironically, always playing the expert can 
be limiting, in terms of both contributions 
to science and career options. Sometimes, 
playing the dummy can be liberating and 
help to reveal opportunities that would oth- 
erwise have been overlooked. Dummies ask 
questions that experts assume were answered 
long ago. Dummies explore subject areas in 
which they lack knowledge. Dummies listen 
more and talk less. 

Becoming a dummy frees you from dogma. 
Developing expertise can often mean ingest- 
ing unquestioned assumptions and accepted 
facts. Such received beliefs can lead to unchal- 
lenged group decision-making and prevent a 
community from recognizing a path-break- 
ing discovery — especially when it comes 
from someone outside the discipline. 

In fact, many exciting scientific discover- 
ies have arisen from a non-expert asking a 
basic question — consider, for example, the 
connection between the extinction of the 
dinosaurs and a meteorite impact, which was 
the brainchild of an experimental physicist, 
not a palaeontologist. Not coincidentally, 
experts ina field often meet these basic ques- 
tions with a fair degree of hostility. 

Embracing your inner dummy is also a 
powerful tool for communicating science. 
Many scientists assume that public hostil- 
ity toward science stems from ignorance of 


|: science, intellect and intelligence are 


the facts, and that science education is the 
remedy. But perhaps the answer is also more 
human engagement by scientists; sometimes 
they need to step out of the role of expert and 
seek to understand the audience's cultural 
and ethical perspectives. 

Scientists’ preoccupation with being an 
expert as they consider career options can 
be enormously limiting. Newly minted 
PhD holders can have a hard time seeing 
themselves as anything but researchers. The 
idea that they are experts in their subjects is 
familiar and comfortable. Less familiar is the 
notion that the transferable skills they have 
developed — such as analysis, research and 
model testing — may also make them adapt- 
able problem-solvers, able to contribute in 
a wide range of career environments. As a 
result, many PhD holders limit their ambi- 
tions to a job within their discipline. 

In reality, doing original scientific research 
in a graduate programme can prepare you 
for a variety of environments and roles. The 
years spent developing and testing a hypoth- 
esis, coping with limited resources and learn- 
ing to use an array of complicated tools for 
gathering and analysing scientific data are 
excellent preparation for being innovative, 
resourceful and effective wherever you work. 
By letting go of the need to be an expert, you 
can approach unfamiliar problems and rec- 
ognize the underlying connections with what 
you have already done. m 


Peter Fiske is chief executive of PAX Water 
Technologies in Richmond, California, 

and author of Put Your Science to Work 
(American Geophysical Union, 2001). 
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UNITED STATES 
Doctorate awards drop 


The number of science doctorates 
awarded in the United States has declined 
for the first time in eight years, according 
to a report by the US National Science 
Foundation (NSF). The NSF's annual 
survey, out in late November, revealed that 
US institutions conferred 25,820 science 
doctorates in 2009, but only 25,589 in 
2010, a decrease of 0.89%. Between 

2002 and 2009, the number climbed 

by 32%. However, 2009-10 saw small 
drops in agricultural sciences, molecular 
biology, neuroscience, Earth sciences and 
chemistry. Report author Mark Fiegener, 
a statistician at the NSF in Arlington, 
Virginia, says that the fall might have 
been caused by the uncertain economy, 
which is prompting science graduate 
students to stay in their programmes for 
longer, rather than claiming their degrees. 
“They may be staying in school until the 
economy looks better and they can get a 
position,” says Fiegener. 


UNITED KINGDOM 
Manchester hiring 


Despite austerity measures across Britain, 
the University of Manchester is creating 
more than 30 science faculty posts as 

part of a £20-million (US$31-million) 
recruitment campaign. It hopes to boost 
its performance in the Research Excellence 
Framework (REF) used to assess UK 
universities. The positions include 

23 professorships and 7 lectureships, and 
are part ofa move to establish more than 
100 posts across all fields. A spokesperson 
says that the scheme will put Manchester 
ahead of institutions that are cutting back, 
so that it will score well in the 2014 REF 
and receive more funding. Manchester has 
a £40-million budget surplus this year as a 
result of cost-saving measures. 


FUNDING 
Gift system for science 


An innovative website lets the public 
donate to scientific projects. SciFlies 
(http://sciflies.org), publicly launched on 
22 November, lists and promotes peer- 
reviewed projects, giving researchers 

6 months to reach funding targets. 
Creator David Fries, a marine scientist 

at the University of South Florida in St 
Petersburg, says that the site lets scientists 
raise money and build a community 

of supporters. Donations are free from 
university contract and overhead rates, says 
Fries, so researchers get a large share. 
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EATING WITH INTEGRITY 


BY DAVID BERREBY 


o, how was the big dinner party? 
A complete disaster! Geoff and I 


haven't spoken since... 

Not since Friday? 

That's right. You know, sometimes we joke 
about divorce when we're arguing, but this 
time... it doesn't feel safe to kid about that. 
It’s just so silent and grim. I think he thinks 
I’ve ruined his career. 

Look, I’m sure you did your best. It was 
a lot of pressure. All three of his new col- 
leagues, plus the Big Boss. I’m sure you did 
everything right. 

I thought I had. I checked everyone for 
allergies. Religious covenants. Lipid and 
corticosterone metabolism. Medica- 
tions, immunizations... 

Well, you can't be too careful. 

And I social-vetted the menu: eve- 
ryone checked off everything. And 
there was nothing to offend anyone. 
Tempeh-salmon, made by our own’bot, 
right here. All ingredients grown ster- 
ile, no contact with organic matter or 
pollutants ... 

You don't have to tell me. I know how 
careful we have to be. How careful you 
are. No one’s been sick from your place in 
years, and how many of us can say that? 

Thanks. I appreciate that, especially 
after what's happened. God, I don’t even 
want to tell. It’s just so mortifying. 

I think youd better, though. 

Sigh. Well, then, from the top. The 
table is set and the containers are all sealed 
and ready to be opened, the dining tents are 
over the chairs... 

You used the good ones? 

Of course! Pressurized, sterilized, totally 
safe, near-transparent. The ones we got 
for our tenth anniversary. So there I am, 
all ready and Geoff is wandering around 
resterilizing forks because everything's per- 
fect and he has nothing to do and then bing! 
In walks our first guest, who is also the Big 
Boss, and he’s carrying several things that 
look like food boxes and I think, uh-oh. 
Then I think, well, of course, those must be 
gifts, maybe a floral hologram or something, 
because it can’t be what I’m afraid it is. It 
just can't be! 

Well, how could you expect that? 

You know how this thing is spreading. 
But I didn’t update before the dinner, so I 
managed to miss the day when Mr Big Boss 
became one of them. 


A question of taste. 


We had three people at work do the same 
thing last week. It’s like some damn cult! 

“T hope it’s not a problem,” he says, with 
that hick accent, “but I’ve decided to Eat 
With Integrity.’ I mean, I about died. And 
the worst of itis, he’s the first guest. Everyone 
else is going to think we approve of this kind 
of thing. 

He’s also Geoff’s boss. You didn’t have 
much choice. 

I know, but I was just so... appalled. And 
disgusted. And I had no idea what to do, 
because we don’t have the equipment. 

You mean it wasn't cooked? 

Oh, no, of course it wasn’t cooked. Mr 
Big isn’t just a new-minted Integretist, he’s 


a hard-core Integretist. The meat has to be 
fresh as can be. He harvests himself, he cooks 
himself, it has to be done within the hour. 

You know, I just realized, in a way it’s ... 
well, it’s sort of like what our great-grand- 
parents ate, isn’t it? Kind of like... meat? 

Exactly the problem! It’s not tofurkey or 
FauxPorc, I can assure you. It’s red and bloody 
and it smells of iron and something else... 

I'm sorry, I think I'm going to be sick. 

You asked me to tell you! 

Oh, God. To think, in your own home! 

Right, so there he is, and he forceps out 
this ... lump of flesh, this oozing ... thing 
like it’s the Dark Ages and he looks at me 
cool as a cucumber and asks me if he can 
use the stove. And that’s when most of the 

others walked in. I 


> NATURE.COM don’t know what was 
FollowFutureson © worse — the disgust as 
Facebook at: they looked at us, or the 
go.nature.com/mteodm © pity. So, of course, into 
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this conversational vacuum leaps our newly 
converted Integretist. How he can talk! How 
hed been thinking about Eating With Integ- 
rity for years, how much sense it made, what 
with all the contaminants and the epidemics, 
you couldn't be too careful, blah blah blah. 
How it was all well and good to grow your 
own foods and use sterile hydroponics but 
you couldn't be too sure, and anyway, we 
evolved to eat meat, and the fact that there 
are 10 billion of us on Earth doesn’t change 
our basic anatomy, yak yak yak. Stem-cell 
technology is perfected now, and you can 
grow whatever you like, they have scaffolds 
for anything, muscle, liver, sweetbreads... 

Oh, my. No one spoke up? 

He’s the boss! No one dared. And I 
think they were all just stunned to see that 
this movement's spreading into their own 
lives, into their own office. It’s not just for 
recovering plague victims and famine 
survivors any more. 

You know, I saw something the other 
night about that. The Integretists are 
getting up to 20% in some cities. Once it 
costs nothing to clone yourself, a certain 
kind of person thinks, well, why not? 

Meanwhile the stove is doing what it 
can, trying to cook this ... this Jump. So it 
starts to... smell, you know? And a cou- 
ple of the more sensitive people, Geoff's 
brother, for one, they excuse themselves. 
And that sets off our guest. 

It's the safest meat there is, he says. Peo- 
ple didn't have to agree, but they shouldn't 
feel free to be downright rude. 

Bit of a buzz-kill, eh? 

... So then people force themselves to sit 
down, and they’re tented up, and their con- 
tainers unseal, but... God, I can still see it! 
No one eats a thing. We're all just too dis- 
gusted and perplexed, so the only sound at 
the table is my one happy guest, Eating With 
Integrity, happily chewing away on what he 
keeps insisting is the only 100% safe and 
healthy food left: his own flesh, cloned and 
grown in a vat next to his coffee machine. 
I can still see him there, having harvested 
himself and cooked himself, now eating 
himself; holding up his fork to our averted 
eyes, smiling and saying, now this, this is Eat- 
ing With Integrity! m 


A science writer, David Berreby writes the 
Mind Matters blog at bigthink.com/blogs/ 
Mind-Matters and is the author of Us and 
Them: The Science of Identity. He lives in 
Brooklyn, New York. 
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GSK-3a/B kinases and amyloid production in vivo 


ARISING FROM C. J. Phiel, C. A. Wilson, V. M.-Y. Lee & P. S. Klein Nature 423, 435-439 (2003) 


A major unresolved issue in Alzheimer’s disease is identifying the 
mechanisms that regulate proteolytic processing of amyloid precursor 
protein (APP)—glycogen synthase kinase-3 (GSK-3) isozymes are 
thought to be important in this regulation. Phiel et al.’ proposed that 
GSK-3a, but not GSK-3, controls production of amyloid’. We analysed 
the proteolytic processing of mouse and human APP in mouse brain 
in vivo in five different genetic and viral models. Our data do not yield 
evidence for either GSK-3a-mediated or GSK-3f-mediated control of 
APP processing in brain in vivo. 

GSK-3f is believed to be central to the pathogenesis of Alzheimer’s 
disease, linking amyloid and tau pathology” >. Unlike GSK-38, neither 
physiological functions nor pathological roles of GSK-3« are well 
explored’. To analyse the function of both GSK-3 kinases in brain, 
we generated mice that were completely deficient in GSK-30 (denoted 
as Gsk3a“°) as well as mice with neuron-specific GSK-3a or GSK-3B 
deficiency using Cre/loxP as for presenilin 1 (ref. 7). 

The contribution of GSK-3« to the processing of APP was analysed 
by Phiel et al.’ by silencing with short interfering RNA in cells and by 
pharmaceutical inhibition with Li* ions (an inhibitor of both GSK-3 
isozymes). We approached this problem genetically by generating two 
strains of GSK-30-deficient mice, denoted as Gsk3a*© and Gsk3a"/~ 
(complete and neuron-specific GSK-3« deficiency, respectively). In 
Gsk3a*®° mice, the gene was inactivated completely in all tissues, 
demonstrated by genotyping, immunohistochemistry and western 
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Figure 1 | APP processing in the brains of Gsk3a"~'~ and Gsk3a*° mice. 
a, Biochemical analysis of murine APP metabolites in total brain homogenates 
by western blotting. Mouse amyloid peptides were measured by specific ELISA. 
APPm, membrane-bound APP; APPs, secreted APP; Cre, Cre recombinase; 
CTF, C-terminal fragments; floxed, recombinant genes with inserted loxP 
elements’. b, Biochemical analysis of hippocampi at 3 weeks after intracerebral 
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blotting (data not shown). In the brains of Gsk3a"‘~ mice, some 
residual activity was expected because GSK-3 enzymes are expressed 
in non-neuronal cell types in the CNS** (Fig. 1). GSK-3a deficiency 
had no evident impact on GSK-3f levels, demonstrating their inde- 
pendent regulation (Fig. 1). 

First, we analysed biochemically the proteolytic processing of endo- 
genous mouse APP in brains of adult Gsk3a" ‘~ mice by measuring 
the APP metabolites in total brain extracts by validated methods*”?""’. 
Neuronal deficiency of GSK-3« did not affect levels of immature and 
mature APP, APP phosphorylated at T668 (pT668), secreted ectodo- 
main APP, nor of carboxy-terminal fragments (Fig. la). Levels of 
mouse amyloid peptides in the brain, measured by specific enzyme- 
linked immunosorbent assay (ELISA), were unaffected in Gsk3a"/~ 
mice relative to age- and gender-matched control mice with floxed 
Gsk3a genes but not expressing Cre recombinase (Fig. 1a). 

We went on to administer AAV-APP.SLA viral vectors (adeno-assoc- 
iated viral vector expressing triple mutant APP with Swedish, London 
and Austrian mutations) by intra-hippocampal injection into Gsk3a"/~ 
mice, to analyse the processing of human mutant APP.SLA (ref. 9). At 
3 weeks after injection, biochemical analysis of hippocampal extracts of 
AAV-APP.SLA-injected Gsk3a" /~ mice revealed no significant differ- 
ences for any of the APP metabolites (Fig. 1b). 

We continued to analyse mice with a complete deficiency in GSK- 
30 generated serendipitously during the expansion of the Gsk3a"‘~ 
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injection of AAV-APP.SLA. Human amyloid peptides were measured by 
specific ELISA’. c, Analysis of murine APP metabolites in the brains of Gsk3a*° 
mice and of human APP metabolites in Gsk3a“°x App(V7171) bigenic mice, 
heterozygous (+/—) or homozygous (—/—) for GSK-3« deficiency. All data are 
mean + s.e.m. 
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colony (data not shown). In addition, we generated bigenic mice by 
crossing the Gsk3a“° mice with our App(V7171) mice, a validated 
model for amyloid pathology*”’"*. We carried out a biochemical 
analysis of metabolites of mouse APP and human mutant APP in 
brains of Gsk3a“° mice and Gsk3a“°XApp(V717I) bigenic mice, 
respectively. None of the APP metabolites derived from either mouse 
or human APP was affected by the complete deficiency of GSK-3a 
(Fig. Ic). 

We subsequently extended the study to the GSK-3B isozyme by 
generating Gsk3b" ‘~ mice with a neuron-specific deficiency of GSK- 
3B. GSK-3B deficiency had no evident impact on the expression of the 
GSK-3a isozyme, further corroborating their independent regulation 
(Fig. 2). Notably, no significant deviation was noted in the brain levels 
of mouse APP metabolites in Gsk3b" ‘~ mice (Fig. 2). 

In addition, we attempted to generate mice deficient in both GSK-3 
isozymes in their central neurons. The combination of mice contain- 
ing both floxed Gsk3 genes and the Thyl-Cre recombinase transgene 
yielded 354 offspring over a time span of 16 months that were all 
genotyped for the five genes of interest: wild-type and floxed GSK-3 
isozymes and Thyl-Cre. None of the pups contained the wanted 
combination with both Gsk3 genes recombined homozygously. The 
outcome proved, not unexpectedly, that complete deficiency of GSK- 
3 activity in central neurons is lethal for mice. 

We conclude that the GSK-3 isozymes do not contribute signifi- 
cantly to the processing of APP in mouse brain in vivo. Of interest, the 
combination of Gsk3b"-/~ with App(V717I) did not yield viable 
offspring (data not shown), in contrast to viable offspring yielded 
by the Gsk3a*°x App( V7171) combination (Fig. 1c). This underlines 
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Figure 2 | APP processing in the brains of Gsk3b"‘~ mice. a, Western blot 
of GSK-3 in brain homogenates of mice with floxed Gsk3b genes, with and 
without expression of Cre recombinase. b, c, Biochemical analysis of proteolytic 
processing of mouse APP in the brains of Gsk3b" ‘~ mice. All data are 
mean + s.e.m. The y axes of graphs in a—c show relative percentage. BAPPs, 
B-secretase cleaved secreted APP; tAPPs, total secreted APP; WO-2, 
monoclonal antibody specific for the amyloid sequence in APP and amyloid 
peptides; 22C11, monoclonal antibodies specific for N-terminus of APP. 


the substantial functional difference of the GSK-3 isozymes in brain; a 
difference that we believe is not, however, related to APP processing. 
The combined outcome substantiates the notion that GSK-3B 
is physiologically and pathologically the dominant isozyme, 
notwithstanding the fact that both are activated by amyloid’. The 
latter point testifies that both kinases contribute, downstream of 
APP, to tauopathy and cognitive defects in bigenic and viral mouse 
models**’** (data not shown) and by extrapolation in human 
disease?*?. 
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Phiel et al. reply 


REPLYING TO T. Jaworski et a/. Nature 480, doi:10.1038/nature10615 (2011) 


GSK-3 has been implicated in the pathogenesis of Alzheimer’s disease 
through regulation of tau phosphorylation, cellular responses to 
amyloid-B and processing of amyloid precursor protein (APP) to 
amyloid-B. We previously reported’ that inhibition of GSK-3 reduces 
the accumulation of AByp and ABy (amyloid-& peptides of 40 and 42 
amino acids) in mouse brain and cell culture and that knockdown of 
Gsk3a reduces amyloid-f accumulation in CHO cells, indicating that 
Gsk3a contributes to the processing of APP in this context’. At about 
the same time, others reported that knockdown of Gsk3b reduces 
amyloid-f production and overexpression of active Gsk3b enhances 
amyloid-f production” ~*. A reasonable explanation for these findings, 
as suggested previously***, was that both of these highly similar 
enzymes contribute to APP processing and their respective contribu- 
tions depend on cell type, relative abundance and assay conditions. 
Jaworski et al.’ now show that APP can be processed in mouse brain in 
the absence of either Gsk3a or neuronal Gsk3b. 

However, Gsk3a and Gsk3b are highly homologous and frequently 
redundant genes*”. For this reason, it is not possible to exclude a role 
for GSK-3 with single gene knockouts. Furthermore, structurally and 
mechanistically diverse GSK-3 inhibitors reduce amyloid-B accu- 
mulation in vivo and in cell culture, including lithium, kenpaullone, 
bisindolylmaleimide I, FRAT peptide, the TDZD-related NP12 and 
kinase-dead GSK-3 (refs 1-6, 10, 11). These compounds inhibit both 
GSK-3a and GSK-3 and therefore get around the issue of redund- 
ancy. Although off-target effects for these diverse GSK-3 inhibitors 
could, formally, explain their effects on APP processing, a far more 
plausible explanation is that these compounds act through GSK-3 
inhibition. 

Jaworski et al.’ show that APP can be processed in mouse brain in 
the absence of either Gsk3a or neuronal Gsk3b, but do not examine 
redundant functions of Gsk3a and Gsk3b in APP processing and do 
not consider the substantial body of work showing that acute inhibi- 
tion of GSK-3 reduces amyloid-B accumulation in vivo. The simplest 
explanation for these findings**® is that both Gsk3a and Gsk3b can 
contribute to APP processing and that inhibition of GSK-3 reduces 
amyloid-B in Alzheimer’s disease models. 
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GSK-3a/B kinases and amyloid production in vivo 


ARISING FROM C. J. Phiel, C. A. Wilson, V. M.-Y. Lee & P. S. Klein Nature 423, 435-439 (2003) 


A major unresolved issue in Alzheimer’s disease is identifying the 
mechanisms that regulate proteolytic processing of amyloid precursor 
protein (APP)—glycogen synthase kinase-3 (GSK-3) isozymes are 
thought to be important in this regulation. Phiel et al.’ proposed that 
GSK-3a, but not GSK-3, controls production of amyloid’. We analysed 
the proteolytic processing of mouse and human APP in mouse brain 
in vivo in five different genetic and viral models. Our data do not yield 
evidence for either GSK-3a-mediated or GSK-3f-mediated control of 
APP processing in brain in vivo. 

GSK-3f is believed to be central to the pathogenesis of Alzheimer’s 
disease, linking amyloid and tau pathology” >. Unlike GSK-38, neither 
physiological functions nor pathological roles of GSK-3« are well 
explored’. To analyse the function of both GSK-3 kinases in brain, 
we generated mice that were completely deficient in GSK-30 (denoted 
as Gsk3a“°) as well as mice with neuron-specific GSK-3a or GSK-3B 
deficiency using Cre/loxP as for presenilin 1 (ref. 7). 

The contribution of GSK-3« to the processing of APP was analysed 
by Phiel et al.’ by silencing with short interfering RNA in cells and by 
pharmaceutical inhibition with Li* ions (an inhibitor of both GSK-3 
isozymes). We approached this problem genetically by generating two 
strains of GSK-30-deficient mice, denoted as Gsk3a*© and Gsk3a"/~ 
(complete and neuron-specific GSK-3« deficiency, respectively). In 
Gsk3a*®° mice, the gene was inactivated completely in all tissues, 
demonstrated by genotyping, immunohistochemistry and western 
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Figure 1 | APP processing in the brains of Gsk3a"~'~ and Gsk3a*° mice. 
a, Biochemical analysis of murine APP metabolites in total brain homogenates 
by western blotting. Mouse amyloid peptides were measured by specific ELISA. 
APPm, membrane-bound APP; APPs, secreted APP; Cre, Cre recombinase; 
CTF, C-terminal fragments; floxed, recombinant genes with inserted loxP 
elements’. b, Biochemical analysis of hippocampi at 3 weeks after intracerebral 
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blotting (data not shown). In the brains of Gsk3a"‘~ mice, some 
residual activity was expected because GSK-3 enzymes are expressed 
in non-neuronal cell types in the CNS** (Fig. 1). GSK-3a deficiency 
had no evident impact on GSK-3f levels, demonstrating their inde- 
pendent regulation (Fig. 1). 

First, we analysed biochemically the proteolytic processing of endo- 
genous mouse APP in brains of adult Gsk3a" ‘~ mice by measuring 
the APP metabolites in total brain extracts by validated methods*”?""’. 
Neuronal deficiency of GSK-3« did not affect levels of immature and 
mature APP, APP phosphorylated at T668 (pT668), secreted ectodo- 
main APP, nor of carboxy-terminal fragments (Fig. la). Levels of 
mouse amyloid peptides in the brain, measured by specific enzyme- 
linked immunosorbent assay (ELISA), were unaffected in Gsk3a"/~ 
mice relative to age- and gender-matched control mice with floxed 
Gsk3a genes but not expressing Cre recombinase (Fig. 1a). 

We went on to administer AAV-APP.SLA viral vectors (adeno-assoc- 
iated viral vector expressing triple mutant APP with Swedish, London 
and Austrian mutations) by intra-hippocampal injection into Gsk3a"/~ 
mice, to analyse the processing of human mutant APP.SLA (ref. 9). At 
3 weeks after injection, biochemical analysis of hippocampal extracts of 
AAV-APP.SLA-injected Gsk3a" /~ mice revealed no significant differ- 
ences for any of the APP metabolites (Fig. 1b). 

We continued to analyse mice with a complete deficiency in GSK- 
30 generated serendipitously during the expansion of the Gsk3a"‘~ 
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injection of AAV-APP.SLA. Human amyloid peptides were measured by 
specific ELISA’. c, Analysis of murine APP metabolites in the brains of Gsk3a*° 
mice and of human APP metabolites in Gsk3a“°x App(V7171) bigenic mice, 
heterozygous (+/—) or homozygous (—/—) for GSK-3« deficiency. All data are 
mean + s.e.m. 
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colony (data not shown). In addition, we generated bigenic mice by 
crossing the Gsk3a“° mice with our App(V7171) mice, a validated 
model for amyloid pathology*”’"*. We carried out a biochemical 
analysis of metabolites of mouse APP and human mutant APP in 
brains of Gsk3a“° mice and Gsk3a“°XApp(V717I) bigenic mice, 
respectively. None of the APP metabolites derived from either mouse 
or human APP was affected by the complete deficiency of GSK-3a 
(Fig. Ic). 

We subsequently extended the study to the GSK-3B isozyme by 
generating Gsk3b" ‘~ mice with a neuron-specific deficiency of GSK- 
3B. GSK-3B deficiency had no evident impact on the expression of the 
GSK-3a isozyme, further corroborating their independent regulation 
(Fig. 2). Notably, no significant deviation was noted in the brain levels 
of mouse APP metabolites in Gsk3b" ‘~ mice (Fig. 2). 

In addition, we attempted to generate mice deficient in both GSK-3 
isozymes in their central neurons. The combination of mice contain- 
ing both floxed Gsk3 genes and the Thyl-Cre recombinase transgene 
yielded 354 offspring over a time span of 16 months that were all 
genotyped for the five genes of interest: wild-type and floxed GSK-3 
isozymes and Thyl-Cre. None of the pups contained the wanted 
combination with both Gsk3 genes recombined homozygously. The 
outcome proved, not unexpectedly, that complete deficiency of GSK- 
3 activity in central neurons is lethal for mice. 

We conclude that the GSK-3 isozymes do not contribute signifi- 
cantly to the processing of APP in mouse brain in vivo. Of interest, the 
combination of Gsk3b"-/~ with App(V717I) did not yield viable 
offspring (data not shown), in contrast to viable offspring yielded 
by the Gsk3a*°x App( V7171) combination (Fig. 1c). This underlines 
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Figure 2 | APP processing in the brains of Gsk3b"‘~ mice. a, Western blot 
of GSK-3 in brain homogenates of mice with floxed Gsk3b genes, with and 
without expression of Cre recombinase. b, c, Biochemical analysis of proteolytic 
processing of mouse APP in the brains of Gsk3b" ‘~ mice. All data are 
mean + s.e.m. The y axes of graphs in a—c show relative percentage. BAPPs, 
B-secretase cleaved secreted APP; tAPPs, total secreted APP; WO-2, 
monoclonal antibody specific for the amyloid sequence in APP and amyloid 
peptides; 22C11, monoclonal antibodies specific for N-terminus of APP. 


the substantial functional difference of the GSK-3 isozymes in brain; a 
difference that we believe is not, however, related to APP processing. 
The combined outcome substantiates the notion that GSK-3B 
is physiologically and pathologically the dominant isozyme, 
notwithstanding the fact that both are activated by amyloid’. The 
latter point testifies that both kinases contribute, downstream of 
APP, to tauopathy and cognitive defects in bigenic and viral mouse 
models**’** (data not shown) and by extrapolation in human 
disease?*?. 
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the same time, others reported that knockdown of Gsk3b reduces 
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as suggested previously***, was that both of these highly similar 
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could, formally, explain their effects on APP processing, a far more 
plausible explanation is that these compounds act through GSK-3 
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Jaworski et al.’ show that APP can be processed in mouse brain in 
the absence of either Gsk3a or neuronal Gsk3b, but do not examine 
redundant functions of Gsk3a and Gsk3b in APP processing and do 
not consider the substantial body of work showing that acute inhibi- 
tion of GSK-3 reduces amyloid-B accumulation in vivo. The simplest 
explanation for these findings**® is that both Gsk3a and Gsk3b can 
contribute to APP processing and that inhibition of GSK-3 reduces 
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The 2009 pandemic arrived suddenly and lethally, exposing 
our plans to reality. Are we now better prepared? 


BY LAURA VARGAS PARADA 


dela Gutiérrez, a door-to-door census- 
A“ for the Mexican government, started 
eeling unwell with fever, headache and 
a sore throat, in early April 2009. Gutiérrez, a 
39-year-old mother of three, lived in the southern 
state of Oaxaca. By the time she arrived at the state 
hospital, she was unable to breathe and was diag- 
nosed with severe atypical pneumonia that might 
have been complicated by her diabetes. Within 
four days, Gutiérrez was dead. 

Lab tests at the Oaxaca state hospital could 
not confirm the cause of death, so throat swabs 
were sent to the National Microbiology Labora- 
tory in Winnipeg, Canada, and to the US Cent- 
ers for Disease Control and Prevention (CDC) 
in Atlanta, Georgia. Ten days later, Cruz would 
become one of the first confirmed fatalities ofa 
new influenza A (H1N1) virus, which originated 
in pigs. By then, nearly half of all the people who 
were in contact with Cruz in the hospital had 
developed respiratory symptoms and one preg- 
nant nurse had fallen ill. Two months later, the 
World Health Organization (WHO) declared a 
global health emergency. 

Early observations showed that this strain of 
flu disproportionately afflicted young people. 
A study of patients in Mexico found that the 


median age of fatalities was 39 years (ref. 1). Epi- 
demiologists feared this outbreak could resem- 
ble the infamous 1918 influenza pandemic, 
also an H1NI strain, which killed an estimated 
50 million people (around 3% of the then global 
population) — half of whom were healthy adults 
between the ages of 20 years and 40 years old. The 
US President’s Council of Advisors on Science 
and Technology calculated a possible scenario of 
30,000-90,000 deaths in the United States alone. 
Fortunately, the transmission rate and virulence 
of the virus could not produce the pandemic that 
was initially dreaded. 


GAINING MORE EXPERIENCE 
Nevertheless, the 2009 H1N1 pandemic proved 
to bea thorough test of the 2005 International 
Health Regulations (IHR), designed to be the 
world’ first line of defence during public-health 
emergencies. (IHR is a legal agreement that dates 
back to 1969 and is binding by 194 party states, 
including all WHO members.) The 2009 out- 
break was the first major assessment of the new 
THR and raised critical questions about why it 
was so difficult to determine the severity of the 
threat, how effective the preparations were, and 
whether the crisis was properly managed. 

In late 2009, director-general Margaret 
Chan of the WHO advised a review of the 
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pandemic to learn from any lessons and eval- 
uate whether the IHR fulfilled its purpose. 
The WHO assigned this task to a newly cre- 
ated IHR review committee, comprised of 25 
international experts from diverse scientific 
fields with experience in public health. The 
committee presented its final report in May 
2011, during the 64" World Health Assembly 
in Geneva, Switzerland. 

The emergence of human infections of avian 
influenza A (H5N1) in 1997 and the corona- 
virus causing severe acute respiratory syndrome 
(SARS) in 2002 expedited international prepara- 
tions for a pandemic. The IHR was updated in 
2005 to respond to these new threats and entered 
into force worldwide in 2007, and are expected 
to be fully operational by 2012. 

A major point of concern raised during the 
outbreaks of bird flu and SARS was the perfor- 
mance of disease warning systems in place. To 
assure a two-way channel of communication 
between the WHO and the party states, the IHR 
required each country to establish, by 2012, what 
it calls a focal point — offices that liaise with the 
WHO at all times and deploy resources for dis- 
ease surveillance, early warning systems and the 
response to a disease outbreak. 

The IHR review committee reported the 
THR was still “not yet fully operational” world- 
wide. “There are still many states that have not 
developed the plans and infrastructure speci- 
fied in the IHR,” says José Ignacio Santos, who 
heads the infectious disease unit of the school 
of medicine at the National Autonomous Uni- 
versity of Mexico in Mexico City and a member 
of the IHR review committee. Although three 
quarters of the 194 party states had plans to co- 
ordinate national efforts in the case of a pan- 
demic influenza outbreak, that statistic didn’t 
tell the whole story. Only a dozen of the 
128 countries that answered a WHO question- 
naire had taken the steps required to put sucha 
plan in practice, including: enacting legislation; 
allocating funding; putting enough people in 
place for detection and alert operations; and 
establishing procedures for surveillance, event 
detection, risk assessment and information 
provision’. In addition, many National IHR 
focal points are unable to communicate infor- 
mation about public health emergencies to the 
WHO inatimely fashion. 

According to the IHR review committee “the 
THR played a central role in the global response 
to the pandemic: In Mexico, the IHR probably 
saved lives thanks to the progress made towards 
establishing plans and stocking antiviral medica- 
tions. But what would happen ifa new virus were 
to appear in countries that are not yet prepared, 
such as some in Africa and Asia? 

The WHO did not escape blame in the 
IHR review committee's assessment. For 
instance, the WHO was inconsistent in its defi- 
nition of ‘pandemic. Some WHO documents 
described pandemics in terms of the number of 
deaths and illness caused by the disease, whereas 
the latest WHO definition is based on the degree 
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of geographical spread. The WHO also failed to 
come up with consistent measures to assess the 
severity of an outbreak. The review committee 
stressed the need for the WHO to strengthen 
its capacity to mounta sustained response, and 
to improve its communication policies — in 
particular by issuing timely guidance in the 
organization's six official languages and hold- 
ing routine press conferences. The review com- 
mittee’s report concluded with a stark warning: 
“The world is ill-prepared to respond to any 
similarly global, sustained and threatening 
public-health emergency.” 


SHARPENING PREDICTIONS 

Despite recent efforts to improve preparedness 
for a global influenza pandemic, there are still 
many things we dont understand about the virus. 
“Influenza is the name of one disease but is caused 
bya family of viruses; says Sylvie Briand, head of 
the global influenza programme at the WHO in 
Geneva, Switzerland. “We don't know very much 
about how these viruses circulate at the global level” 
Itis still an enigma as to when, and to what extent, 
influenza viruses change their genetic make-up, and 
how they spread around the globe. 

A reminder of how influenza can still surprise us 
came on 2 September, 2011, whena newswine influ- 
enza virus A (H3N2) wasisolated from two children 
in the United States — one in Indiana and one in 
Pennsylvania’. Human H3 and H2 influenza strains 
were transmitted to pigs in the 1990s. Those viruses 
evolved in swine and now differ from the seasonal 
influenza that circulates in humans. People born 
before the 1990s “still have antibodies to the swine 
viruses because their immune system was exposed to 
the original viruses; says Nancy Cox, director of the 
influenza division at the CDC. “But young kids are 
very susceptible” The CDC has identified a vaccine 
candidate against this virus — just in case, says Cox. 


Camp Funston, Kansas, 
site of the first recorded 
cases of Spanish flu (left), 
nurse draws water for flu 


RETR 


the epidemic (right). 


In an ominous report published in 2011, 
researchers studied H2N2 viruses that have not 
circulated in human populations for several 
decades, but are still common infections of birds 
and swine’. The study found that people under 
50 years of age have little or no immunity to this 
strain of the virus, reigniting fear that an H2N2 
pandemic could, like H1N1, jump from animals 
to humans. 

The 2009 swine flu was a reassortant virus 
that crossed from pigs to humans (a reassortant 
virus contains genetic material from two or more 
related viruses). Cox believes that this exchange 
between human and porcine viruses is more 
common than had been previously thought: 
“This is probably going on ina lot of other coun- 
tries. In Switzerland they have detected swine 
infections in humans,’ Cox says. In a retrospec- 
tive study published in 2007, researchers found 
evidence of 50 apparent cases of influenza that 
were transmitted from pigs to humans. Most of 
the cases were reported in the United States — 
probably, Cox says, because the US has a very 
good surveillance system, but there were also six 
cases in the Czech Republic, four in the Nether- 
lands, three in Russia, three in Switzerland, and 
one case each in Canada and Hong Kong”. 

Prior to 2009, three global networks were in 
place for the early detection and/or surveillance 
of influenza: the 2005 IHR, the WHO’s Global 
Influenza Surveillance Network (GISN) and sys- 
tematic event detection (that monitors informa- 
tion published not only formally in journals but 
also in newspapers and online forums). In March 
2009, just weeks before the swine flu outbreak, 
more than half of WHO member states (104 of 
193 countries) had no or very limited seasonal 
influenza surveillance capacity, according to a 
2011 article co-authored by Briand’. (Both the 
United States and Mexico had good surveillance 
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systems, which is why the new virus was identi- 
fied in those countries early on.) The same report, 
echoing some of the themes articulated by the 
THR review committee report, identified other 
deficiencies that were revealed during the H1N1 
pandemic. Among them were a lack of standards 
for reporting illness, risk factors and mortality. As 
a result, different countries used different crite- 
ria to define influenza-like illness and collected 
information on different risk factors. This incon- 
sistency makes it difficult to do the sort of com- 
parisons that would help public health officials 
assess how well they are doing relative to other 
places in the world. After the swine flu pandemic, 
the WHO worked to better define standards and 
develop a web-based platform called FLuID (Flu 
Informed Decisions) for direct reporting of epi- 
demiological data, such as intensity of influenza 
transmission, number of hospitalizations, and 
risk factors for severe disease. FLuID is similar 
to the WHOS FluNet surveillance system, which 
collects information concerning the viral type 
and subtype. 

Scientists are confident that a good surveillance 
system, along with the GISN monitoring antibod- 
ies against the virus, will help detect new viruses 
before they becomea threat to global health. And 
if this is the case, says Briand, “we have a protocol 
for rapid containment [if] we detect an outbreak 
early enough to contain it at its source.” 


NEW DIRECTIONS 
One of the main challenges during an outbreak is 
to assess the severity of the pandemic disease. Even 
with today’s technology, there are a few critical days 
of uncertainty after the outbreak is identified, but 
before enough epidemiological and clinical data 
are gathered to allow public health officials to plot 
the most effective response. 

“In the 2009 pandemic, some decisions had 


CORBIS & LIFE PICTURES/GETTY IMAGES, BETTMANN/CORBIS, BETTMANN/CORBIS, CDC/SCIENCE PHOTO LIBRARY, KPA/ZUMA/REX FEATURES. 


FROM LEFT: NATIONAL MUSEUM OF HEALTH AND MEDICINE/SCIENCE PHOTO LIBRARY; NATIONAL ARCHIVES/SCIENCE PHOTO LIBRARY; TIME, BETTMANN/ 


| 


oo 


' Catch Something Made in the U.S.A. 


1968 


700,000 people die from H3N2 infection 


(descended from H2N2); originates in Hong Kong. 


5 


40NG KONG FLU 
HONG OT AN! ! 


1970 


1980 


to be taken before we hada clear picture? Briand 
says. Drawing on that experience, she says, health 
officials are trying to improve their ability to assess 
an outbreak at its earliest stages. “If we isolate the 
virus,’ she explains, “we can determine how dif- 
ferentitis from the current circulating viruses and 
if it is susceptible to available antivirals.” Testing 
blood samples for antibodies to influenza viruses 
will allow researchers to determine how many 
people have been exposed to the virus and how 
prevalence fluctuates over time. 

The absence of important data can be over- 
come. During the 2009 pandemic, determining 
the fatality rate was sometimes difficult because 
information about the number of persons infected 
and the number of deaths was incomplete. Exacer- 
bating the problem, the WHO, as well as national 
authorities, overwhelmed public health officers 
with requests for specific data, such as the number 
of cases. In countries with limited epidemiological 
and laboratory infrastructure, the need to deliver 
so many statistics led to an unfortunate diversion 
of personnel away from patient care. And in coun- 
tries during the early stages of the pandemic, this 
information proved to be less useful for assessing 
severity than rates of hospitalization and complica- 
tions, and actual number of deaths. 

As it turned out, however, other parameters 
proved to be good proxies to gauge the severity 
of disease. Measures included the proportion of 
cases that required hospitalization for treatment 
or that required intensive care and mechanical 
ventilation. Other useful data were the proportion 
of previously healthy individuals without under 
lying risk factors that developed severe disease. And 
as always, the severity of a pandemic will depend 
on conditions inside a country, such as access to 
health services, general health of the population, 
and various social and behavioral factors. 

As the influenza virus continues to circulate 
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in human and animal populations across the 
globe, it is crucial to identify changes in the viral 
genome — especially those mutations that con- 
fer resistance to antiviral drugs. In 2009, Briand 
says, “we were fortunate because the virus was 
sensitive to neuraminidase inhibitors that we had 
in our stockpile.” Neuraminidase is a viral sur- 
face protein that allows the virus to spread from 
infected cells to other cells (see “Lines of defence; 
page S9). But any strategic stockpiling of anti-flu 
drugs needs to account for the limited shelflives 
of the medications. At the WHO, public health 
officers are looking for technical solutions to 
prolong the usefulness of the antivirals to at least 
two or three more years beyond the roughly 
5 year shelf lives of today’s medications, as well as 
“encouraging the development of new antivirals’, 
says IHR review committee member Santos. 
The best weapon against flu is still the vac- 
cine. Once the HIN1 2009 virus was identified, 
the WHO needed just one month to select the 
pandemic virus and develop the seed strains for 
vaccine development. But that's only the first step. 
“Tt takes at least six months to produce significant 
quantities of a vaccine,’ Klaus Stohr of Novartis 
claimed in 2010 (ref. 7). Stohr is vice president 
of influenza strategy at Novartis Vaccines and 
Diagnostics in Cambridge, Massachusetts. One 
vexing question is how much vaccine to produce. 
By the time vaccines against the 2009 HIN] virus 
were made available, the production was enough 
to treat only 10% of the population worldwide. 
Santos also highlights pharmaceutical deficits. 
“The technology to produce influenza vaccine has 
not changed much in the past 60 years,’ he says. 
Even if vaccines are produced in abundance, 
there's no guarantee that the medication will get 
to the people who need it. “Vaccine distribution 
worldwide is the other challenge,” says Santos. 
Access to vaccines, he says, can be improved by 
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legal agreements between manufacturers and 
countries. Other paths to more equitable access to 
vaccines include differential pricing so that poorer 
countries can get the vaccines cheaper — already 
in practice, but says Santos, needs to be made 
more universal — as well as vaccine donations by 
manufacturers to the WHO. At the national level, 
there is the need to assure the maintenance of the 
cold chain — the continuous sequence of properly 
refrigerated vessels to transport the vaccines from 
factory to the patient — and to plan for local vac- 
cine distribution and administration. “One way to 
facilitate both vaccine and antiviral distribution,’ 
Santos says, “could be to regionalize the effort, giv- 
ing each WHO regional office the responsibility to 
secure the resources needed for the area.” 

No one can predict with precision when and 
where the next global outbreak will arise. But, 
in view of what has been learned from recent 
influenza pandemics, it is possible to define the 
most likely scenarios and to prepare for them. “It 
is like your personal life,’ says Briand. “You can- 
not predict anything but there are scenarios that 
are more likely than others.” The lessons learned 
from the latest pandemic “will certainly drive our 
preparedness to different directions now. 

One key lesson is the need to be flexible in our 
response to unexpected conditions and to be able 
to respond in times of uncertainty. We must, in 
other words, be like the flu virus itself. m 


Laura Vargas Parada is a freelance science 
writer in Mexico City. 
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Some people get horribly sick from the flu, and even die. 


Others just rest for a few days. What’s behind this 
fateful variation? 


BY CHRISTINE JUNGE 
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our spouse had the flu last week and 
Ye barely get out of bed for five 

full days — she was feverish and her 
aching body exhausted. Then your four-year- 
old became sick and you had to rush her to 
the hospital when her fever spiked. Now that 
you ve come down with the virus, you're tired, 
a bit achy, but that only lasts for two or three 
days. How is it that the flu virus can have sucha 
variable impact on people? 

In order for the flu to pose a danger to any 
individual, two major factors must align: the 
host person has to be a clean slate — someone 
who hasn't been exposed to an influenza virus 
related to the one circulating; and the virus 
itself has to be especially potent. The range 
and severity of symptoms the host experiences 
depend on the answers to a few other questions, 
such as whether the person’s immune system 
is compromised, and whether some genetic 
vulnerability exists. 

The first question to delve into when 
asking about why a particular person gets very 
sick is: what makes a particular strain of flu 
more likely to take hold in a human host? The 
answer lies in the basic structure of the patho- 
gen. The influenza virus always consists of two 
types of major proteins: one hemagglutinin 
(H) and one neuraminidase (N). There are at 
least 16 types of neuraminidase and 9 of hema 
gglutinin, in principle, capable of forming more 
than 100 strains of flu virus. In reality, however, 
the flu viruses that commonly infect humans 
have been limited to only three types of hemag- 
glutinin and two types of neuraminidase’, 
giving six possible flu strains. Each season, 
there’s a chance that, through mutations and 
recombinations, a novel virus will emerge — 
one that few, or no, humans have encountered 
before. Rolling this viral game of dice leads to 
deadly consequences. 1957 saw the emergence 
of the H2N2 strain. This represented a major 
structural shift for the virus; for the previous 
40 years, all the flu cases had been caused by 
HIN1 viruses. This new virus killed 70,000 
people in the United States alone’. 

A more recent example of a novel virus was 
the H5N1 swine flu that circulated in 2009. 
Unlike seasonal flus, which are most severe 
in the elderly and those with other health 
problems, this virus killed a large number of 
young people — about a quarter of the deaths 
were in people younger than 24 years (many 
of them without pre-existing illness), and 
more than 60% were in people aged 25-64 
years — again, many of them otherwise 
healthy. The reason for this reversal of for- 
tunes was that the virus was similar to ones 
that had circulated many years ago. Many 
elderly people, therefore, had some immunity 
to it, but their children and grandchildren had 
never been exposed to any influenza like this 
one and so lacked any resistance to it’. 

Another issue that sets flu apart from some 
other infectious diseases is that antibodies 
against influenza don't have a very long shelf 
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Up close and personal: H5N11 (green) in cell culture. 


life. In certain groups of people, there are addi- 
tional concerns: after peaking, antibody levels 
decline faster in older people than in younger 
people. In people with compromised immune 
systems, such as someone with HIV/AIDS or 
someone taking certain cancer drugs, it's double 
trouble — antibody levels dontt rise as high after 
exposure to the virus in the first place, nor do 
they last as long. “It’s difficult to say whether any 
individual is still protected after a year or not, 
which is why we recommend getting the flu shot 
each year,’ says Lisa Grohskopf an infectious 
disease specialist who works in the influenza 
division of the Centers for Disease Control and 
Prevention (CDC) in Atlanta, Georgia. She adds 
that protection can wane from the beginning of 
one flu season to its end, since the season can 
stretch for seven or eight months. 

Infants or young children who haven't come 
into contact with the flu virus before — either by 
getting infected or from vaccination — don't have 
antibodies specific to the viruses circulating in a 
given year, putting them at risk for getting the flu. 

For reasons that aren't always clear, accord- 
ing to microbiologist Peter Palese at the Mount 
Sinai School of Medicine in New York, people 
with certain chronic illnesses also have a harder 
time resisting flu or fighting it once they do get 
it. Flu is more dangerous for people with heart 
disease, diabetes, asthma, chronic obstructive 
pulmonary disease (COPD) and morbid obe- 
sity. Palese explains that some of those diseases, 
such as asthma and COPD, cause persistent 
lung problems, so any illness that causes fluid 
to accumulate in the lungs, as flu does, would 
be more problematic for these people. For the 
obese, he says, breathing can be difficult, so the 
act of clearing mucus from the lungs might also 
be difficult, and the virus would linger in the 
lungs longer. Grohskopf adds that various health 


problems can create a vicious cycle. For instance, 
she says, “diabetes affects the immune system 
and so may affect the body’s ability to fight influ- 
enza; in turn, being ill with the flu can cause dif- 
ficulty with keeping blood sugar under control,” 
an outcome that can complicate diabetes. 

Flu’s effects can be amplified not only by ill- 
ness but by our genes. Indeed, just as the genetics 
of the flu virus itself (which determines which H 

and N type it is) can 
make it stronger or 


“The exuberance __ weaker, the genetics 
of the of the person who 
inflammatory encounters the influ- 
responseis likely —_enza virus can make 
at the root of that person more or 
why we have less likely to become 
symptoms of ill. One thing that 


has become clear in 
recent years is that 
single nucleotide 
polymorphisms in one of a number of genes 
governing immunity affects a person’s suscep- 
tibility to infectious diseases, and how sick he 
or she becomes from those infections. A 2008 
study by Ute Vollmer-Conna, a geneticist at the 
University of New South Wales in Australia, 
who studies the genetic response to infection, 
advanced that idea. Vollmer-Conna’s research 
looked at 300 people infected with one of 
three viruses: Epstein-Barr, Coxiella burnetii 
(which causes Q fever) and the Ross River 
virus’. Her conclusion: those who had the most 
severe symptoms, and those who were ill for 
the longest, had polymorphisms that signifi- 
cantly influenced pro- and anti-inflammatory 
cytokine production. Vollmer-Conna explains 
that these genetic predispositions intensify the 
inflammatory response to an acute infection, 
as well as prolonging recovery time. Although 


infection.” 
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the scope ofher study did not include influenza, 
she hypothesizes that “the same principles apply 
across a much broader range of common infec- 
tions, including flu’: 

Another, more recent study looked at the 
changes in gene expression caused by the flu 
virus specifically. Researchers exposed 17 
healthy subjects to flu virus type H3N2. Nine 
came down with flu symptoms and eight did 
not. When the scientists looked at blood from 
each subject, they noticed that different genes 
were up-regulated, depending on whether the 
person fell ill or not. “Individuals who became 
symptomatic had up-regulation of genes 
involved in the inflammatory response,” says 
Geoffrey Ginsburg, director of the center for 
genomic medicine at the Duke Institute for 
Genome Sciences and Policy in Durham, North 
Carolina, and one of the authors of the study. 
“The exuberance of the inflammatory response 
is likely at the root of why we have symptoms 
of infection,’ he says. In those who did not 
become ill, he explains, the genes that were 
up-regulated were those involved in protein 
synthesis and oxidative stress. “Asymptomatic 
subjects may have mechanisms at work that 
prevent active infection and therefore reduce 
the inflammatory response.” 

Aside from more research into the human 
genes that control the immune response to flu, 
there are plenty of other lines of investigation 
that would help explain why different people 
respond to flu so differently. Vollmer-Conna 
argues for more research to tease out which 
genes and biological processes are responsible 
for recovery after an infection. This knowledge, 
she writes, could let doctors identify those at risk 
for severe illness following infection with com- 
mon diseases, and therefore set them up with 
special prevention and treatment programmes. 

In a 2004 research paper, Palese posed sev- 
eral questions, including what makes it possible 
for some influenza viruses to hop from one ani- 
mal species to another; questions that he hoped 
researchers would have answered by now. He 
concedes that seven years later, the questions 
on his list are still pertinent. One in particular 
that still remains an important area of research, 
says Palese, are the viral genes that determine 
transmission versus the host or environmental 
factors that promote its spread. 

Clearly researchers have made some pro- 
gress, but as the poet Robert Frost once wrote, 
there are miles to go before we sleep. Frost 
was writing about stopping in the woods ona 
snowy evening in winter — which, of course, 
is flu season. = 


Christine Junge is a science writer in North 
Easton, Massachusetts. 
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midst the series of tragic events that was the first half 
of the twentieth century, the 1918 influenza pandemic 
stands out for its swift lethality. Over the course 
of about 18 months at the close of the First World War, an 
estimated 50 million people died from this viral infection. 

Since the last flu pandemic in 2009, public health officials have 
started to stand guard against another catastrophe (page S11). 
Yet echoes of this outbreak have reverberated through the 
decades and the disease, in its various guises, kills at least a 
quarter of a million people each year worldwide. The virus is 
adept at adaptation, leaving scientists chasing a moving target. 
The virus’s surface proteins mutate rapidly and can combine 
into dozens of variants (page S2); the antibodies we produce to 
fight one year’s flu strain can’t stop the others. 

Influenza evades the effects of drugs developed against 
it. First-generation antivirals are now almost useless, and 
newer classes of drugs — the neuraminidase inhibitors 
oseltamivir (Tamiflu) and zanamivir (Relenza) — are beset 
by resistance(page S9). And, owing to a number of factors, 
including genetics, individual responses to the virus vary 
widely; researchers are starting to get a handle on why some 
people infected with influenza become very sick or die while 
others are unscathed (page S14). 

Fears of widespread contagion have grown because the 
virus seems to be making a habit of jumping from its original 
bird hosts into domesticated mammals and birds (page S4). 
Perhaps the great hope lies in flu prevention. Researchers 
are reporting progress in developing a vaccine that could be 
effective against a broad range of flu strains (page S6), a feat 
that might mark a turning point against this killer. 
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EPIDEMIOLOGY 


Racing against the flu 


Influenza mutates fast and spreads easily, earning a place among humanity’s biggest killers. 


BY DUNCAN GRAHAM-ROWE 


nfluenza kills more than 250,000 people 
each year, worldwide. And with new strains 
ever emerging, there is always the threat of 
a repeat of the Spanish flu pandemic of 1918, 
which claimed more than 50 million lives. 
Indeed, despite vaccines and antiviral drugs 
that offer some protection to the vulnerable, 
nearly a century after that catastrophe we still 


well on surfaces and people’s hands,’ says flu 
specialist Anthony Mounts with the Global 
Influenza Programme at the World Health 
Organization in Geneva, Switzerland. 
Moreover, the flu virus changes all the time, 
mutating at the genetic level, which alters the 
surface level — in the hemagglutinin and neu- 
raminidase antigens that coat it (and provide 
the familiar H-N designations). These proteins 
are what our antibodies attach to; by continu- 


rapidly mutate that the virus is able to develop 
resistance to previously effective drugs, such 
as oseltamivir (marketed as Tamiflu by Swit- 
zerland-based Roche) and amantadine. 

On average between 5-15% of the global 
population are infected each year. Most at risk 
are children younger than 2 years and adults 
65 years or older, as well as people of any age 
with asthma, diabetes or chronic heart dis- 
ease. But no one is entirely free from risk. 


seem no closer to being able to predict and 
ultimately stave off such a virulent outbreak. 
One reason this has proved so difficult is the 
speed at which this acute viral infection can 
spread, transmitted by coughs and sneezes. 
“Tt’s a very stable respiratory virus that survives 


ally changing, these anti- 
gens enable the virus to 
infect people who have 
developed immunity 
to other strains. It is 
because of its ability to 


“This was particularly notable in this last 
pandemic,’ says Mounts, referring to the 
2009 outbreak, in which as many as half of 
those contracting the disease had fully func- 
tioning immune systems and were perfectly 
healthy beforehand. 


SD NATURE.COM 
Keep up to date with 
the latest news and 
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THE FAST OR THE FURIOUS 


To exact the greatest toll, influenza needs to be both fast-spreading and highly lethal. Unlike the 1918 ‘Spanish flu’, the two recent outbreaks — 
H5N1 ‘bird flu’ and H1N1 ‘swine flu’ — only had one attribute each. 
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The virulence of an outbreak is a combina- 
tion of the viral lethality and infectiousness. 
These, in turn, depend on which of the three 
varieties of human flu virus is the pathogen: 
type A, type B or type C. Type A is by far the 
most virulent; with 16 known subtypes, it 
mutates into countless different genetic and 
antigenic strains at a rate far faster than type 
B or type C. This ability to mutate quickly 
allows type A flu to spread faster. All the 
flu strains discussed in this Nature Outlook 
are influenza type A. 

Some of the most virulent strains encoun- 
tered are those that have managed to pass 
from other species to humans. The avian 
H5N1 flu virus had a very high lethality, kill- 
ing 56% of those who were infected. However 
it was also not terribly infectious: between 
2003 and 2006 there were only 263 cases of 
H5N1, mitigating its overall impact. In con- 
trast, the HIN1 swine flu virus of 2009-2010 
had a case fatality rate of only around 0.03% 
but, as it can be passed easily from human 
to human, it was responsible for more than 
13,000 cases in a single month. 


INFLUENZA MENAGERIE 

Flu has been found in a variety of mammals, 
from horses, cats, dogs, and pigs, to seals, fer- 
rets, camels and even whales. But it is mainly 
through wild aquatic birds that the virus is able 
to jump species — most likely because the virus 
originated in these avian species. Although it is 
next to impossible to predict where and when 
an inter-species jump is likely to take place, 
or which of these will result in a particularly 
virulent human strain, clues are beginning 
to emerge. 

By analysing the genetic and antigenic varia- 
tions of 13,000 strains of the H3N2 virus, a group 
led by infectious-disease experts Colin Russell 
and Derek Smith at the University of Cambridge 
in the UK were able to track the evolution of the 
virus as it moved around the world between 
2002 and 2007. They found that, at least for the 
common H3N2 virus, strains tend to originate 
in East and Southeast Asia, and mutate their 
way around the world until they end up in what 
Smith calls an “evolutionary graveyard” in South 
America — a continent that, in terms of flu at 
least, he says, is therefore “the safest place to be 
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because you get the biggest warning” 

The conditions in Asia are ripe for new influ- 
enza strains to emerge, says Russell. For one 
thing, the virus is able to exist in circulation 
almost perpetually, giving it more of a chance 
to mutate. Because tropical countries have no 
winter, seasonal influenza epidemics typically 
occur during the rainy season, which can hap- 
pen at different times of the year in neighbour- 
ing places like Bangkok and Kuala Lumpar, says 
Russell. This means that within East and South- 
east Asia there will be an influenza epidemic 
happening somewhere at any point in a year. 
“Given the travel and trade between cities and 
countries in the regions, influenza seems to be 
able to spread readily from place to place,” says 
Russell. “We often talk about it as being simi- 
lar to runners passing a baton ina relay race, as 
viruses move from epidemic to epidemic.” 

It’s a race that researchers, public health 
authorities, and the public continue — with 
mixed success — to try to disrupt. m 


Duncan Graham-Rowe is a science writer in 
Brighton, UK. 


HIN1, SNAPSHOT OF A RAPID PANDEMIC 
In May 2009, 48 countries reported 13,398 cases of influenza H1N1, the first influenza pandemic in four decades. 
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SOURCES: WHO/GIP AND FAUCI, A. S. MILBANK MEMORIAL LECTURE 1-22 (2006). 
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The flu catcher 


Richard Webby studies the ecology of influenza, trying to 
better understand how certain strains of influenza can leap 
across the species divide from animals to people. Nature 
Outlook sat down with him to learn more about his research. 
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What is the basic ecology of influenza? 

All influenza type A viruses likely originated 
in wild aquatic birds, and they've moved out 
to other hosts from there. The next layer is 
the intermediate hosts. Domestic animal 
species — mainly poultry and pigs — most 
often transmit the viruses to humans. Once 
the viruses have found their way from their 
natural wild hosts into domestic species, they 
behave differently. Something about these 
domestic reservoirs seems to increase the 
virus’s ability to infect people. 


What are the biggest unanswered questions 
when it comes to these interspecies jumps? 
What, all of a sudden, makes an animal virus a 
human virus? What specific virologic factors, 
what host factors, what environmental factors, 
align to allow that to happen? Were starting to 
gain greater under- 


: ; standing of the types 
‘If there - of ee oie 
anything that in animal populations, 
keeps many of US but we don’t have a 
up at nig ht,it’s firm grasp on which 
the H5 virus.” of them we should be 
most concerned about. 


The wildlife component is probably still one of 
the biggest missing pieces and it’s one of the 
hardest to study. At least with the domesticated 
species you don't have to chase them around too 
long to catch them. 

Still, we have learned a lot in the past decade. 
You pick up an old textbook and it'll say ifa virus 
is to jump from the aquatic bird reservoirs to 
humans, it probably has to go through an inter- 
mediate mammalian host, like a pig. The HS5N1 
strain of the influenza A virus— the bird flu that 
emerged in Asia in the late 1990s — taught us 
that viruses can also use domestic poultry as the 
intermediate host. I can’t think of many viruses 
that can pass directly between birds and humans 
besides influenza — certainly none that have the 
same capacity to cause human disease. 


Did bird flu teach us anything else? 

H5N1 resulted in alot more resources being put 
into understanding the animal reservoirs of influ- 
enza. Before bird flu emerged, what informa- 
tion we had typically came from fairly confined 
geographical regions. But in the last ten years 
we've increased that knowledge a lot, and now 
we understand more about the virus in different 
parts of the world. Of course, there are still many 
gaps to fill, but we're certainly much better off. 


Do wild birds experience flu like people do? 

Not really. Flu-infected wild birds don’t usually 
appear to be sick. It's a fairly stable host-micro- 
organism relationship, most likely evolved over 
millions of years, and the virus doesn’t cause 
many symptoms in the birds. When influenza 
viruses first enter a new host species, they might 
cause a lot of disease but mortality drops over 
time. The virus is now simply using wild birds as 
ameans to propagate and maintain itself. Highly 
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pathogenic forms of avian flu typically arise 
through evolution in domestic poultry species, 
and when they transmit back to wild birds, 
sometimes they've become more virulent or 
severe and do cause mortality in that bird spe- 
cies. But that’s the exception rather than the rule. 


How do wild birds transmit flu amongst 
themselves? 

That’s still unclear. One research angle is try- 
ing to understand hotspots of virus activity. 
The Delaware Bay region on the US east coast, 
which my colleagues and I have been studying, 
is a prime example. Shorebirds stopping in on 
their northward migration in May to feed on 
the eggs of spawning horseshoe crabs seem to 
have a very high prevalence of influenza. There's 
something special about that environment — 
probably that there are huge numbers of birds, 
many of them naive for influenza, coming 
together in one location at one time — and the 
virus spreads very rapidly. A few studies look- 
ing at those same bird populations further up or 
down their migratory path have shown much 
lower influenza burdens. Flu hotspots also 
appear in Australia, and we have some ongoing 
studies in Alberta, Canada. But they probably 
occur in many other parts of the world as well, 
where waterfowl gather to feed or breed. 


How many types of influenza viruses circulate 
in wildlife? 

The influenza virus has two major proteins on 
its surface: hemagglutinin (H) and neuramini- 
dase (N). Within the aquatic bird populations 
there are 16 serologically and genetically dis- 
tinguishable hemagglutinins and 9 different 
neuraminidases. This is where the flu subtypes 
get their names, like HSN1. Most — but not all 
— of the 144 possible combinations have been 
found circulating in wild birds. When we start 
to move away from that natural reservoir into 
domestic animals and humans, the diversity of 
viruses diminishes. 


Are other animals besides wild birds and 
domesticated poultry and pigs part of the 
influenza ecosystem? 
In terms of wildlife, it’s not just birds. We also 
have good serologic evidence that wild pigs 
can be infected by flu, and there are very likely 
other wild hosts that we know nothing about 
at all. When it comes to domesticated species, 
horses tend to harbour a horse-specific lineage 
of influenza virus, though that virus recently 
transferred to dogs. And the H5N1 bird flu 
virus and the 2009 H1N1 pandemic virus, 
which was widely called swine flu, have both 
jumped into dogs and cats. Exactly what role 
those animals play in terms of the ecology of 
flu is unclear, and I don't 


> NATURE.COM think we have a good 
Scientific American handle on what the true 
looks at where fluis prevalence of the virus is 
lurking: within their populations. 
go.nature.com/Igkgwr But certainly they can be 


hosts, they can get infected from other species, 
and they can probably transfer the virus within 
their own species as well. 


Why does the influenza virus seem to be 

so active in jumping between animals and 
people right now? 

That’s the question everyone is asking. Certainly 
the much larger demand on protein from the 
global population and rearing animals in larger 
numbers in smaller areas play a role in the 
evolution of some of these viruses. And these 
practices are also bringing domestic species into 
more contact with wild species. 


What questions are you trying to answer with 
your current research? 
We're focused on understanding how viruses 
behave over time within a given animal popu- 
lation, whether that be wild bird populations, 
poultry markets or swine herds. We are going 
back to some locations repeatedly to under- 
stand how the viruses themselves and their 
prevalences change over time. Then, if we do 
detect interspecies transmissions, we study 
these viruses in animal 


“The wildlife models to understand 
component the virologic basis 
is probably for the differences in 
still one of epidemiology that we 
the biggest see on the ground. 
missing pieces _ For instance, we're 
and it’s one of undertaking a fairly 
the hardest to large study with col- 
study.” leagues at the National 


Research Center in 
Cairo to study people regularly in contact with 
poultry to understand the real prevalence of 
the H5N1 bird flu virus jumping into humans. 
Typically, we only detect the virus in people 
with severe disease symptoms, since they're the 
ones that come into hospitals. But that’s very 
likely just the tip of the iceberg — and we have 
absolutely no idea how big the iceberg is. We've 
also been going back to some US swine farms 
to study circulating viruses — how they come 
into these populations, how long they stay, and 
how they leave. 

It’s very practical research. For example, here 
at this WHO meeting we're using detection data 
from over the past year to make sure that we 
have the right diagnostic capacities to detect 
the most current strains, and ultimately have 
the right vaccines to protect the public. 


Nearly 3 years on, what’s the lesson from the 
2009 H1N1 swine flu pandemic? 

The most important lingering question is 
finding out what was special about the H1N1 
pandemic virus that allowed it to become estab- 
lished in humans. There’sa lot of work studying 
the differences between it and the viruses that 
were circulating in pigs before the pandemic 
that never successfully jumped to humans. Had 
somebody identified the pandemic virus in pigs 
before the outbreak, nobody would have been 
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jumping up and down. It had none of the hall- 
marks that would have had us scuttling to make 
vaccines against it — no functional PB1-F2 pro- 
tein, which contributes to a flu virus’ lethality, 
for instance, and no novel gene segments. That's 
obviously not good. We hope this research will 
help us predict which viruses we need to be 
most concerned about. 


The H5N1 bird flu virus so far hasn’t shown 
much ability to pass from person to person. 
What’s the likelihood that the worst-case bird 
flu pandemic scenario will happen? 

That’s a tough one. The threat of HSN1 becom- 
ing a pandemic is likely lower than for any H1, 
H2 or H3 influenza virus because H5 isn't as 
contagious in humans. But if there is a virus we 
do not want in humans, it is the H5 because it’s 
so deadly. One of the hallmarks of highly patho- 
genic influenza viruses is that they accumulate 
additional amino acids in the hemagglutinin 
protein, which are thought to allow the virus 
to generate a systemic infection rather than just 
a respiratory tract infection. Only H5 and H7 
viruses are known to do this naturally. 

H5N1 has had 10 years to try, which tells 
us that it’s not one, two, or even three or four 
changes that are required for it to become a 
human pathogen — it’s probably more. On 
the other hand, there are examples of viruses 
taking several decades to jump species, despite 
prolonged contact. So we now have a little bit of 
confidence that H5N1 can’t become transmis- 
sible easily, but unfortunately I think it’s still a 
possibility. Ifthere’s anything that keeps many 
of us up at night, it’s the H5 virus. 


Are there other influenza strains that 
researchers have their eye on as well? 

There are a handful of viruses that likely pose 
the most threat. H5 being one because we know 
it infects humans and when it does, it causes 
severe disease. H7 poses a similar concern. Then 
there are the H2s, which have the capacity to be 
a successful human pathogen. There was an H2 
pandemic in 1957, and the virus disappeared in 
1968, so there’ a large percentage of the popula- 
tion that has no immunity to that virus now. 


What do you find most interesting about 
influenza? 

It still amazes me how little we know about these 
viruses. There’ not a whole lot to them—they’re 
fairly simple viruses — but we really have no 
good feel at all for what allows them to jump 
species. It’s an exciting field to be in. m 


Richard Webby, a virologist at St Jude 
Children’ Research Hospital in Memphis, 
Tennessee, directs the World Health 
Organization’ collaborating center for studies 
on the ecology of influenza in lower animals 


and birds. 


Interview by Rebecca Kessler, a freelance 
science journalist in Providence, Rhode Island. 
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Scientists developing a human vaccine against the avian influenza. 


PREVENTION 


Vaccine for all seasons 


As researchers map the stable parts of the proteins that stud the surface of influenza, the 
decades-long quest for a universal flu vaccine is showing signs of progress. 


BY JANA SCHLUTTER 


ile the eyes of flu researchers were 
focused on Asian bird markets in 
2009, a pandemic began to emerge 


at the opposite end of the world in Mexico and 
California: H1N1 influenza, a version of the 
1918 flu virus circulating in pigs for nearly a cen- 
tury, had suddenly leaped back into the human 
population, which now lacked herd immunity. 
The virus took the world by surprise. 

It took months to develop a vaccine matched 
to HIN1— too long to thwart the pandemic, 
which peaked in March 2009 and then again in 
early November, 2009. “We were fortunate that 
the virus was not really a nasty bastard and did 
not kill so many people,’ says virologist Robert 
Webster, whose laboratory at St Jude Children’s 
Research Hospital in Memphis, Tennessee, 
tracks flu viruses and guides the development 
of flu vaccines to stop them. “If we had an H5N1 


virus that was transmitting from human to 
human: God help us! We would have run out of 
antivirals almost overnight ? 

The 2009 outbreak reignited the hunt for a 
universal vaccine. In contrast to the seasonal 
flu jab or infection, the spreading pandemic 
virus, because it was so unlike its predecessors, 
elicited more cross-reactive, broadly neutral- 
izing antibodies than usual — and now sci- 
entists had the technology to find them. One 
patient even managed to make an antibody 
that could inactivate all 16 subtypes of the influ- 
enza type A virus — the so-called FI6 antibody. 
(Type A influenza is the most virulent kind, 
and is responsible for virtually all major human 

flu outbreaks.) 


> NATURE.COM Tt was, however, a rare 
For some of the find. Antonio Lanza- 
latest research on vecchia, an immunolo- 


influenza vaccines: 
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gist at the Institute for 
Research in Biomedicine 
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in Bellinzona, Switzerland, and his colleagues 
had to screen 104,000 white blood cells (B cells) 
collected from eight donors until they found one 
cell that produced the FI6 antibody. FI6 binds to 
hemagglutinins representative of all the 16 sub- 
types of type A influenza, including the H1N1 
swine flu and avian influenza H5N1, as well as 
to the more common H3N2 strains. In theory, 
this FI6 antibody provides a blueprint for design- 
ing a one-size-fits-all shot. “People have been 
searching for it forever,’ says Webster. “Now, it 
looks as if the Holy Grail for flu really might be 
achievable” 


STEADY TARGET 

The flu virus can skillfully play a chess-like game 
with the human immune system. Its surface is 
packed with varieties of two main proteins: 
neuraminidase and hemagglutinin. Hemagglu- 
tinin in particular plays two vital roles: its stem 
contains the machinery that allows the virus 
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TARGETING A SHAPE-SHIFTING VIRUS 


A universal flu vaccine needs to zero in on the parts of the 


virus that do not change as the virus evolves. 


Lipid envelope 


Protein shell 


M1 


M2 


Nucleoprotein (RNA) 


Neuraminidase 


to fuse with host cells; while flashy loops on its 
globular head act as decoys. To an antibody, 
each virus appears to be covered with some- 
thing resembling a dense tropical forest. The 
freely moving chains of amino acids through 
the treetops offer alluring binding sites. But 
these chains mutate and alter their shapes rap- 
idly. The antibody that the host’s immune system 
tailors to fit into the branches of one virus will be 
limited to that virus, and not work against any 
new influenza. 

Because existing flu vaccines mimic a natural 
infection, they fall into the same trap: even ifour 
body manages to make a few broadly neutral- 
izing antibodies they are vastly outnumbered 
by strain-specific ones that bind to the globu- 

lar head — and most 


“When we people have to get a 
understand the ratte | — 
basics behind gee cpeehearh 
cage To sidestep the trap, 
thisbiologywe 4. ; : 
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vaccines.” highly stable region 


of the hemagglutinin 
stem that tends to stay the same even after the 
virus mutates. FI6 and most other cross-reactive 
antibodies push their way in and go for these 
hard-to-find targets. “This is the first time that 
an antibody can neutralize every single influ- 
enza virus,” says Rino Rappuoli, head of vaccines 


Hemagglutinin 


research at Swiss pharmaceutical company 
Novartis. But while calling FI6 a “milestone”, 
Rappuoli is cautious about seeing it as the key to 
making a universal vaccine. “That’s tough,” he 
says. “We don't know how to do it yet, but it is a 
dream that can come true.” 


ROOT OF THE PROBLEM 

There are two ways to use broadly reactive 
antibodies: as a passive immunization to treat 
severely ill patients who did not respond to anti- 
virals, or as a template to develop a prophylactic 
vaccine for everyone. The latter is the more diffi- 
cult task — it requires sophisticated engineering 
of the antibody to derive a protein that not only 
fits the antibody’s binding site but also manages 
to coax the human immune system into produc- 
ing enough ammunition to prevent infection. 
Many researchers favour starting with the first 
option. “It's effective, it’s extremely broad, and it’s 
human — there should be no problem injecting 
antibodies into humans,’ says Lanzavecchia. “To 
be frank, our first goal is to use the antibody as 
a therapeutic” 

Gary Nabel, director of the Vaccine Research 
Center at the National Institute of Allergy and 
Infectious Diseases (NIAID) in Bethesda, 
Maryland, agrees on the difficulty in making a 
universal preventative flu vaccine. “We've been 
doing similar work with HIV, he says. Creating a 
prophylactic vaccine “sounds simple, but it takes 
time?’ The challenge, then, is in creating a vac- 
cine that will induce this specific yet unnatural 
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Receptor binding pocket (green) 


Two parts of the 
hemagglutinin 
protein remain 
unchanged as the 
virus mutates 
and are thus 
good targets for 
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conserved region 
on the stem and 
more recently 
discovered the 
receptor binding 
pocket on the 
head. 


Conserved stem region (yellow) 


response. Even if one presents a specific site ofa 
protein to the immune system, there are many 
ways in which antibodies can bind to it. “That's 
clearly an important problem to solve for any 
infectious disease, and flu can be our poster 
child,” says Nabel. “When we understand the 
basics behind this biology we will be in the driv- 
er’s seat to design a lot of vaccines that could be 
very, very promising” 

While eliciting ‘anti-stem’ antibodies in 
humans is difficult, it is not impossible. Nabel 
and his team were the first to apply a so-called 
prime-boost approach in humans to achieve this 
goal. This technique entails giving a combina- 
tion of two vaccines administered in sequence 
to induce the strongest immune response pos- 
sible against a broad array of influenza strains, 
including H5N1, H5N2 and H9N2. The prim- 
ing shot was a DNA vaccine: small, circular 
pieces of bacterial DNA genetically engineered 
to code for a specific protein that targets an 
avian flu hemagglutinin. Then, 24-weeks later, 
volunteers were given a booster vaccine made 
of whole inactivated H5N1 virus. This regimen 
enhanced the immune response against avian 
flu. In addition, three individuals were able to 
make broadly neutralizing antibodies directed 
at the hemagglutinin stem, a result that Nabel 
says is a “proof of concept” for a universal influ- 
enza vaccine. 

The mice in microbiologist Peter Palese’s lab 
at Mount Sinai School of Medicine in New York 
are already protected against various flu strains 


8 DECEMBER 2011 | VOL 480 | NATURE | S87 


| OUTLOOK | INFLUENZA 


by experimental, prophylactic flu vaccines. 
Palese has shown that two truncated versions 
of hemagglutinin, one with its globular head 
chopped off and the other only partly expressed, 
work better than full-length hemagglutinin, but 
theyre not perfect: they offer good protection 
against H3 strains of influenza virus, but they 
are weaker against H1. Moreover, Palese echoes 
the familiar caution from anyone working with 
animal models: “It’s promising,” he says, “but 
mice are not men’. 

Nonetheless, scientists and vaccine com- 
panies around the world are hard at work. 
Advances in high-throughput technology allow 
them to screen individual B cells from healthy 
volunteers or patients and search for rare anti- 
bodies. In 2008, a group based at the Dutch 
company Crucell was the first to find broadly 
neutralizing antibodies against influenza in 
humans. Apart from exploring therapeutic 
approaches, they are also trying to vaccinate 
rodents with conserved viral structures. 

Normally, the immune system overlooks the 
virus's stem (because it’s a target that’s hard to 
see and to get to) and produces instead many 
antibodies that target the viral protein’s head. 
Most broadly neutralizing antibodies, on the 
other hand, bind to conserved domains on 
the stem. Crucell is trying to recreate these 
conserved areas and expose them to the 
immune system to induce the production of 
broadly neutralizing antibodies. It’s crucial to 
use the right binding domain, because other- 
wise the antibody might be just as strain-specific 
as anything that binds to the head. “We know 
that we can rebuild the stem in a way that it 
looks like the binding domain,” explains Cru- 
cell immunologist Katarina Radosevic. But the 
question is how to rebuild these structures in a 
stable form so that they can induce an immune 
response in animals. “It's a complex task,’ Rado- 
sevic says. “You cart just chop off a part of a 
molecule and expect the rest to fold and func- 
tion the same way ” 


IN THE POCKET 

Recently, another target for a universal vaccine 
was discovered, one that is conserved but does 
not hide away in the protein folds. At the crown 
of hemagglutinin’s globular head is the recep- 
tor-binding pocket, which allows the virus to 
infect host cells. With minor modifications, this 
pocket has the same structure on all the 16 sub- 
types of hemagglutinin. But the pocket is tiny — 
much smaller than a typical antibody. Nobody 
expected it to be a good vaccine target, and yet 
this appears to be the case. 

To get snapshots of how the immune sys- 
tem reacts to the seasonal vaccine, research- 
ers from Duke University in Durham, North 
Carolina, screened antibodies from volunteers 
several times after vaccination. One of these anti- 
bodies, dubbed CH65, seemed to be a little 
more effective than others. The Duke team sent 
the information to Stephen Harrison, a struc- 
tural biologist at Harvard Medical School and 


Children’s Hospital in Boston, Massachusetts, 
for analysis. “We were surprised to find that it 
docks to the receptor-binding pocket,’ says Har- 
rison. “About half of the contacts are with amino 
acid residues in the pocket itself. The degree and 
intimacy of mimicry is striking” 

What's more, point mutations in the area sur- 
rounding the pocket did not hinder the antibody 
from docking. When the researchers tested the 
antibody against 36 strains of H1 that had cir- 
culated within the last three decades, it blocked 
30 of them. “Over 30 years of evolution of the 
virus, there were hardly any mutations that hada 
strong effect on binding,” Harrison says. Another 
possibility, he adds, is that since 1977, the anti- 
body arose so rarely that the virus did not have 
to escape its attack. 

Harrison's research interest is not so much the 
creation of a universal flu vaccine as in advanc- 
ing the basic science of affinity maturation: 
When exposed to the same antigens time and 
again, the immune system produces antibodies 
with slight mutations that will bind much more 
effectively to the invader. But why do some small 
changes make such a difference to the immune 

response — and how 
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had for many years 
been the only ones available. These include the 
ion channel M2, the matrix protein 1 (M1) and 
the nucleoprotein that the virus needs to keep 
its genome stable. “We have known about these 
conserved proteins for at least 20 years,’ says 
Rappuoli. While acknowledging that “theoreti- 
cally” one might prove to be a good target for a 
universal vaccine, he says that “there hasn't been 
much progress’ on these fronts. 

Many companies are still pursuing these tar- 
gets. VaxInnate, a biotech company in Cran- 
bury, New Jersey, has engineered a hybrid 
molecule that consists of four copies of M2e 
— part of the ion channel M2 that sits on the 
virus surface — fused into the bacterial pro- 
tein flagellin. VaxInnate recently reported that 
its vaccine, called Vax102, safely produces 
an immune response in humans that should 
be protective against all strains of influenza 
A. DynaVax, a biotech company in Berkeley, 
California, has fused M2e and nucleoprotein to 
create a vaccine candidate called N895; the goal 
is to encourage antibodies against M2e as well 
as T cells against the nucleoprotein. Acambis, 
the UK biotech company based in Cambridge, 
now owned by Sanofi Pasteur of Lyon, France, 
evaluated its own M2e vaccine in a phase I 
study in 2008 and found that it was immuno- 
genic against influenza A, and well-tolerated. 
The company also conducted another study 
that it says showed the vaccine protecting about 
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70% of ferrets against bird flu. 

Further development has stagnated. Sanofi 
Pasteur is hesitant about moving its M2e vac- 
cine to phase II trials. “M2e alone is unlikely to 
be better then the current seasonal vaccines,” 
says Jeffrey Almond, Sanofi’ vice president for 
discovery research and external R&D. “So we 
stepped backa little and thought about what we 
could add to M2e. There is no reason why a uni- 
versal vaccine shouldnt have multiple compo- 
nents.” Meanwhile, Merck, based in New Jersey, 
has halted its own trials. 


A DIFFERENT TACK 

Another path toward a universal vaccine 
involves not trying to elicit antibodies, but 
rather boosting the body’s T-cell response to 
infection. While antibodies prevent the virus 
from infecting host cells, T cells help to clear the 
virus from the body by killing flu-infected cells. 
That's the approach being pursued by vaccin- 
ologist Sarah Gilbert at the Jenner Institute — a 
vaccine research organization based at Oxford 
University in the UK. Gilbert uses an attenu- 
ated poxvirus called MVA, which presents 
the flu nucleoprotein and M1 to the immune 
system. As a result, Gilbert reports, “we sawa 
very large peak in T-cell response — everybody 
improved.” This T-cell-mediated immunity 
might provide another line of defense against 
flu, possibly in combination with a protein vac- 
cine that targets the stem of the hemagglutinin. 
Gilbert invited 11 vaccinated volunteers and 11 
non-vaccinated volunteers to a quarantine facil- 
ity. To test their vaccine, the researchers dripped 
H3N2 into the noses of their subjects and moni- 
tored their flu symptoms as well as their T-cell 
response. 

Combining Gilbert’s vaccine with the sea- 
sonal flu shot yields a preparation that boosts 
not only T cells but also the antibody response. 
Following vaccination with the seasonal flu 
shot alone, young people respond well by 
producing antibodies, whereas the same 
vaccine is less effective in older people. “Using 
both vaccines together might be extremely 
useful for vaccinating the elderly,’ Gilbert says. 
Usually, the seasonal flu jabs don’t work as 
well for older people, because their immune 
response diminishes quickly, and it becomes 
more and more difficult to make new antibodies 
as we age. “We boost what people already have,” 
Gilbert explains. “We are not trying to prime 
new responses.” 

Even a potent universal protein vaccine 
modelled after the perfect antibody would not 
change the fact that different age groups react 
differently to a flu jab. Thus a once-in-a-lifetime 
shot that protect against all strains might never 
be achieved for everyone: “The question is what 
universal means,” says Rappuoli. “Should it 
cover all the pandemic strains and the seasonal 
strains? That’s almost impossible. But it’s OK to 
have a dream to move forward? m= 


Jana Schliitter is a science writer in Berlin. 


TAKING ON A TOUGH VIRUS 


Flu drugs tend to stop working after the virus mutates enough to become resistant to them, 


and the arms race continues apace. 
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Antiviral treatments are a critical component of an 
effective healthcare response to influenza, but drug 
resistance to the treatment -of-choice has public health 


officials searching for other options. 


BY ROXANNE PALMER 


lhe influenza virus has adapted to 
Tevet the first generation of flu antivirals 
impotent, and there are signs of resistance 
to newer remedies. In late August 2011, the 
International Society for Infectious Diseases, 
based in Boston, Massachusetts, reported that 
25 influenza patients in eastern Australia were 
infected with a strain of the notorious swine 
flu — influenza A (H1N1) — resistant to the 
widely used drug oseltamivir (Tamiflu), made 
by Roche, headquartered in Basel, Switzerland. 
The Australian case is the largest cohort of 
oseltamivir-resistant flu yet reported as scientists 
and pharmaceutical companies explore ways to 
outsmart the virus. 
The good news is that resistance to antiviral 
drugs is not widespread in influenza strains for 
the upcoming 2011-2012 season — at least not 


yet. “Of the circulating strains of influenza in 
humans, we do not see resistance to oseltami- 
vir,’ says Charles Penn, an antiviral drug expert 
with the global influenza program of the World 
Health Organization (WHO) in Geneva, 
Switzerland. Although resistance to oseltami- 
vir was reported in as many as 1% of HIN1 
flu samples collected during the winter of 2010- 
2011, says Penn, that level doesn’t pose enough 
of arisk to warrant the WHO to change its treat- 
ment recommendations. 

That doesn’t mean that drug resistance is not 
a problem. Oseltamivir has successfully treated 
millions of patients since 1999, but mutations 
that confer resistance were described as early 
as 1998 — and given the ever-changing nature 
of flu, it can be hard to predict the trajectory of 
oseltamivir resistance. “In a span of two to three 
years we've seen a blossoming of resistance. 
At any time, the whole game can change,’ says 
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Zachary Taylor, an infectious disease fellow at 
the Kaiser Permanente Fontana Medical Center 
in Sacramento, California. In part to safeguard 
against the possibility of such game-changing 
developments, drug developers are slowly 
filling the pipeline with alternative therapies 
(see ‘Drugs to treat influenza infection’). Each 
drug come with side effects, which make them 
only worthwhile for those whom the flu could be 
potentially lethal — the elderly and the immu- 
nocompromised. 

Given the wily history of the influenza virus, 
any sudden appearance of drug resistance is 
certain to concern public health officials. The 
first antiviral drugs to combat the disease — 
the adamantanes, which target the M2 channel 
protein to block virus entry into host cells — are 
now essentially useless. The US Centers for Dis- 
ease Control and Prevention (CDC) found that 
100 % of seasonal H3N2 flu in the 2009-2010 
season and 99.8% of 2009 pandemic H1N1 flu 
were resistant to adamantanes. 

Oseltamivir belongs to a class of drugs called 
neuraminidase inhibitors. These agents block 
the active site ofa viral protein called neurami- 
nidase (N), thereby arresting the influenza virus 
ability to leave the host cell after it proliferates. 
The most common way for the influenza virus 
to evade oseltamivir is via the H275Y muta- 
tion (also known as H274Y) of neuraminidase, 
which replaces a single histidine amino acid 
with a tyrosine. This alteration interferes with 
the drug's ability to bind to the protein — a prob- 
lem acknowledged by the maker of oseltamivir. 
“There remains a medical need and room for 
additional treatment options, especially for 
the management of severe infections and for 
improved pandemic preparedness,’ says Klaus 
Klumpp, Roche’ top virologist. Klumpp says the 
Roche is supporting research into new therapies 
targeting viral replication as well as other mecha- 
nisms, but notes that these efforts are preclinical. 

Fortunately, viruses with the H275Y mutation 
are still susceptible to a different neuraminidase 
inhibitor: zanamivir (marketed by UK-based 
GlaxoSmithKline (GSK) as Relenza). Zanami- 
vir, the first neuraminidase inhibitor discovered, 
is generally administered by oral inhalation; its 
side effects include dizziness and nose irritation. 

The WHO recommends using zanamivir to 
treat patients afflicted with oseltamivir-resistant 
flu strains, and it’s already a viable alternative to 
oseltamivir. Zanamivir was actually the first 
neuraminidase inhibitor on the market, but its 
cousin oseltamivir was approved shortly after- 
wards and captured a larger market share. In 
January 2011, GSK began a double-blind study 
comparing intravenous zanamivir with oral 
oseltamivir. The study, which has enrolled 462 
adolescents and adults, is due to be completed 
in September 2013. 

While zanamivir 
remains an arrow in the 
quiver to be used in case 
of an oseltamivir-resistant 
strain, another drug 
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DRUGS TO TREAT INFLUENZA INFECTION 


Clinical name Type of drug 

Oseltamivir (oral) Neuraminidase 
inhibitor 

Zanamivir (inhaled) Neuraminidase 
inhibitor 


Neuraminidase 
inhibitor 


Peramivir (intravenous) 


DAS181 (inhaled) 


ADS-8902 (adamantine, 
ribavarin, oseltamivir) (oral) 


Fusion protein 


Triple combination 


Amantadine (oral) Adamantane 


Rimantidine (oral) Adamantane 


targeting viral neuraminidase is already avail- 
able in Asia and might be useful for patients who 
cant tolerate the other neuraminidase inhibitors. 
This third neuraminidase inhibitor — peramivir, 
developed by BioCryst Pharmaceuticals of Dur- 
ham, North Carolina — is available in Japan (as 
Rapiacta) and South Korea (as Peramiflu) but 
is still undergoing clinical trials in the United 
States. Its known side effects are similar to those 
of oseltamivir — diarrhea, nausea and vomit- 
ing. As an intravenous drug, peramivir can be 
administered to very sick or hospitalized patients 
who wouldnt be able to swallow oseltamivir or 
inhale zanamivir. In October 2009, the US Food 
and Drug Administration issued an emergency 
use authorization (EUA) for peramivir for peo- 
ple unable to take oseltamivir or zanamivir. 
The peramivir EUA expired in June 2010, and 
although the approval process has been fast- 
tracked in the United States, general approval 
of the drug is still several years away; the phase 
III clinical trial is not expected to be completed 
until May 2013. 


THE VIRTUE OF VARIETY 

Other pharmaceutical companies, cognizant of 
fears of resistance to neuraminidase inhibitors, 
are exploring a variety of ways to attack flu. One 
drug making its way along the pipeline is DAS 
181, an oral antiviral drug developed by NexBio 
in San Diego, California, under the trade name 
Fludase, which is now in phase II clinical trials. 
The drug isa fusion protein, which is created by 
combining genetic sequences that encode two 
or more other proteins into a single protein. It 
aims to prevent infection by inactivating viral 
receptors on cells in a patient's respiratory tract, 
making it harder for the virus to latch onto host 
cells. A study by NexBio published in 2009 
reports that Fludase was effective against 11 
oseltamivir-resistant strains of the H1N]1 virus 
— including a couple of variations that showed 
signs of reduced sensitivity to zanamivir. In Sep- 
tember 2011, NexBio presented results from the 
phase II trial at the Interscience Conference on 


Brand name and 
manufacturer 


Tamiflu (Roche) 


Relenza (GSK) 


Rapiacta, in Japan); 
Peramiflu in South Korea 
(BioCryst) 


Fludase (NexBio) 
N/A (Adamas) 


Symmetrel (Endo 
Pharmaceuticals) 


Flumadine (Forest 
Pharmaceuticals) 


Status 


Commercially available in 
most countries 


Commercially available in 
most countries 


Available in Japan and 
South Korea; in Phase III 
testing in U.S. 


Phase Il testing in U.S. 
Phase II testing in U.S. 


Commercially available; 
not recommended for 
influenza due to resistance 


Commercially available; 
not recommended for 


influenza due to resistance 


Antimicrobial Agents and Chemotherapy in 
Chicago, Illinois, that showed Fludase had cut 
the level of virus in patients’ blood after just the 
second day of treatment, much as zanamivir and 
oseltamivir do. However, viral load has not been 
found to correlate with clinical severity of a flu 
patient's illness, and its use as a surrogate end- 
point is controversial. 

Any single flu drug has drawbacks. A mix- 
ture of treatments might be a better strategy to 
combat the virus — and in some cases, overcome 
resistance. Adamas Pharmaceuticals, based in 
Emeryville, California, for example, has been 
developing a three-drug combination treat- 
ment strategy of oseltamivir, amantadine — an 
adamantane drug thought to be obsolete — and 
ribavirin (another antiviral drug commonly 
used to treat hepatitis C infection). In cell cul- 
tures, the three drugs worked better than any 
pair, and were even able to inhibit viral activity 
in strains of influenza resistant to oseltamivir. 

Nobody knows for sure how the three drugs 
work together. Adamas researcher Jack Nguyen 
speculates that each drug can take out the viruses 
that slipped past the others — a one-two-three 
punch combo. “After each stage you have 
fewer and fewer viruses making it through the 
cycle,’ Nguyen says. It may also be possible that 
oseltamivir's binding to neuraminidase causes 
other proteins on the virus’ surface to change 
shape and become more susceptible to the other 
drugs, Nguyen adds. 

In April 2010, Adamas released the results 
of a pilot study of its three-drug combination 
in seven patients with weakened immune sys- 
tems. Five of the six patients that received the 
triple treatment responded by day 10 of therapy. 
The lone patient that was given only oseltamivir 
did not respond after 20 days. “This pilot study 
was an important first step in validating that 
the combination of three antivirals can provide 
a virologic and clinical benefit to patients at 
risk for complications of influenza, says Janet 
Englund, a clinical investigator who led the trial 
at Seattle Children’s Hospital in Washington. The 
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US National Institute of Allergy and Infectious 
Diseases in Bethesda, Maryland is recruiting 
720 people for a phase II trial testing the efficacy 
of Adamas triple combination therapy versus 
oseltamivir in patients with severe medical 
conditions, such as heart or lung disease, which 
make them more susceptible to serious compli- 
cations from a bout of the flu. 

More options are being explored. Recently 
researchers at the Scripps Research Institute in 
La Jolla, California, infected mice with a fatal 
dose of influenza, then gave some of the animals 
compounds that inhibited their ability to pro- 
duce cytokines — cell signalling molecules that 
summon T cells and other immune system effec- 
tors to the site of infection. Mice that received 
these cytokine blockers plus the antiviral drug 
oseltamivir had a much higher survival rate 
(96%) than the other experimental groups that 
received only the cytokine-blocking compound 
(82% survival); oseltamivir alone (50%); or no 
treatment at all (21%). This suggests that much 
of the damage caused by flu is not due to the virus 
itself, but to an overenthusiastic response froma 
person's immune system. ‘Cytokine storms, trig- 
gered by an overproduction of immune signal- 
ing molecules, can cause significant damage to 
tissue. Indeed, the phenomenon is thought to 
be responsible for a large proportion of deaths 
during the 1918 influenza pandemic. 

Still, targeting the patient instead of the 
virus has its risks. “When you start tweak- 
ing the immune system, you have to wonder 
about whether you're under correcting or over- 
correcting the immune response,’ explains 
Dean Blumberg, head of paediatric infectious 
diseases at the University of California, Davis. 

Overcorrect, and you 


“In.aspan of two exacerbate the prob- 
to three years lem of cytokine storms; 
we've seen a under correct, and you 
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idea of using antiviral 
drugs at all, figuring that once contracted, the 
disease will simply run its course. In most cases, 
it will. In fact, neuraminidase inhibitors usually 
only shorten the duration of illness by about one 
day, and that's if the drug is taken within 48 hours 
of the first sign of flu symptoms. Yet Blumberg 
says that personally, “as a doctor and as a parent” 
he usually recommends a course of antivirals, to 
cut down on the length of illness. 

With no signs of an avian or swine flu pan- 
demic looming, the need for a wide spectrum of 
antiviral treatments might seem over cautious. 
But when warring against viruses, carefully laid 
preparations made in peacetime may serve well 
in the thick of the fight. m 


Roxanne Palmer is a science writer in 
Brooklyn, New York. 
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When the core hydrogen is exhausted during stellar evolution, the 
central region of a star contracts and the outer envelope expands and 
cools, giving rise to a red giant. Convection takes place over much of 
the star’s radius. Conservation of angular momentum requires that 
the cores of these stars rotate faster than their envelopes; indirect 
evidence supports this’”. Information about the angular-momentum 
distribution is inaccessible to direct observations, but it can be 
extracted from the effect of rotation on oscillation modes that probe 
the stellar interior. Here we report an increasing rotation rate from 
the surface of the star to the stellar core in the interiors of red giants, 
obtained using the rotational frequency splitting of recently detected 
‘mixed modes’”**. By comparison with theoretical stellar models, we 
conclude that the core must rotate at least ten times faster than the 
surface. This observational result confirms the theoretical prediction 
of a steep gradient in the rotation profile towards the deep stellar 
interior’”*. 

The asteroseismic approach to studying stellar interiors exploits 
information from oscillation modes of different radial order n and 
angular degree /, which propagate in cavities extending at different 
depths’, Stellar rotation lifts the degeneracy of non-radial modes, pro- 
ducing a multiplet of (2/ + 1) frequency peaks in the power spectrum for 
each mode. The frequency separation between two mode components 
of a multiplet is related to the angular velocity and to the properties of 
the mode in its propagation region. More information on the exploita- 
tion of rotational splitting of modes may be found in the Supplementary 
Information. An important new tool comes from mixed modes that 
were recently identified in red giants**. Stochastically excited solar-like 
oscillations in evolved G and K giant stars* have been well studied in 
terms of theory’, and the main results are consistent with recent 
observations from space-based photometry’*"*. Whereas pressure 
modes are completely trapped in the outer acoustic cavity, mixed modes 
also probe the central regions and carry additional information from the 
core region, which is probed by gravity modes. Mixed dipole modes 
(1 = 1) appear in the Fourier power spectrum as dense clusters of modes 
around those that are best trapped in the acoustic cavity. These clusters, 
the components of which contain varying amounts of influence from 
pressure and gravity modes, are referred to as ‘dipole forests’. 

We present the Fourier spectra of the brightness variations of stars 
KIC 8366239 (Fig. 1a), KIC 5356201 (Supplementary Fig. 3a) and KIC 
12008916 (Supplementary Fig. 5a), derived from observations with the 
Kepler spacecraft. The three spectra show split modes, the spherical 
degree of which we identify as / = 1. These detected multiplets cannot 
have been caused by finite mode lifetime effects from mode damping, 


because that would not lead to a consistent multiplet appearance over 
several orders such as that shown in Fig. 1. The spacings in period 
between the multiplet components (Supplementary Fig. 7) are too 
small to be attributable to consecutive unsplit mixed modes* and do 
not follow the characteristic frequency pattern of unsplit mixed 
modes’. Finally, the projected surface velocity, v sini, obtained from 
ground-based spectroscopy (Table 1), is consistent with the rotational 
velocity measured from the frequency splitting of the mixed mode that 
predominantly probes the outer layers. We are thus left with rotation 
as the only cause of the detected splittings. 

The observed rotational splitting is not constant for consecutive 
dipole modes, even within a given dipole forest (Fig. 1b and Sup- 
plementary Figs 3b and 5b). The lowest splitting is generally present 
for the mode at the centre of the dipole forest, which is the mode with 
the largest amplitude in the outer layers. Splitting increases for modes 
with a larger gravity component, towards the wings of the dipole mode 
forest. For KIC 8366239, we find that the average splitting of modes in 
the wings of the dipole forests is 1.5 times larger than the mean splitting 
of the centre modes of the dipole forests. 

We compared the observations (Fig. 1b) with theoretical predictions 
for a model representative of KIC 8366239, as defined in the Sup- 
plementary Information. The effect of rotation on the oscillation 
frequencies can be estimated in terms of a weight function, called a 
rotational kernel (K,,;). From the kernels, it is shown that at least 60% of 
the frequency splitting for the ]= 1 mixed modes with a dominant 
gravity component is produced in the central region of the star (Fig. 2). 
This substantial core contribution to mixed modes enables us to 
investigate the rotational properties of the core region, which was 
hitherto not possible for the Sun, owing to a lack of observed modes 
that probe the core region (within a radius r< 0.2 Ro; ref. 15). The 
solar rotational profile is known in great detail for only those regions 
probed by pressure-dominated modes’**. In contrast to these modes 
in the wings of the dipole forest, only 30% of the splitting of the centre 
mode originates from the central region of the star, whereas the outer 
third by mass of the star contributes 50% of the frequency splitting. By 
comparing the rotational velocity derived from the splitting of such 
pressure-dominated modes with the projected surface velocity from 
spectroscopy, we find that the asteroseismic value is systematically 
larger. This offset cannot be explained by inclination of the rotation 
axis towards the observer alone (Supplementary Tables 1 and 2), but 
originates from the contribution of the fast-rotating core (Fig. 2). 
Furthermore, internal non-rigid rotation leads to a larger splitting 
for modes in the wings of the dipole forest than for centre modes. 
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Figure 1 | Oscillation spectrum of KIC 8366239. a, Grey squares, radial 
modes; grey triangles, ! = 2 modes; black circles, ] = 1 rotational multiplets. A 
zoom on the region of 185-195 jtHz and the analysis of the comb-like structure 
of the oscillation spectrum in an échelle diagram are shown in the online 
material (Supplementary Figs 1 and 2). The spectral window of the Fourier 
analysis can be found in the online material (Supplementary Fig. 11). The y-axis 
indicates the flux variation power in parts-per-million squared for each 
frequency bin. b, The observed rotational splitting for individual / = 1 modes. 
Error bars, standard deviation of the measured rotational splitting of dipole 
modes. Similar analyses of the stars KIC 5356201 (Supplementary Figs 3 and 4) 
and KIC 12008916 (Supplementary Figs 5 and 6) are discussed in the online 
material. c, Theoretically predicted rotational splitting assuming two different 


Fora model rotating rigidly, the reverse behaviour is expected (Fig. 1c). 
Because the prediction for the rigidly rotating model is incompatible 
with the observed trends of the splittings (Fig. 1b) but can be well 
reproduced qualitatively under the assumption of non-rigid rotation 
(Fig. 1c), we conclude that the three stars investigated here (see Table 1, 
last column) rotate non-rigidly, with the central region rotating much 
faster than the surface. 


sey ge 7 


~ yy = 161.74 wHz (! = 1) 


v4, = 163.1 Hz (/ = 1) 


Y= 153.04 wHz (/ = 2) 7 


m/M 


Figure 2 | Contributions to the total rotational splitting. Partial integrals of 
normalized rotation kernels, illustrating the contribution from different regions 
to the rotational splitting for pressure-dominated modes (v4, v5; solid lines) and 
gravity-dominated modes (1, V2, v3; dashed lines) of degree / = 1 and! = 2, asa 
function of the stellar mass-fraction, m/M. The kernels have been calculated for 
modes from a representative model of KIC 8366239, as defined in the 
Supplementary Information, with oscillation frequencies given. Vertical dotted 
lines, left to right: boundary of the helium core, the hydrogen-burning shell and 
the bottom of the convective envelope. 
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rotation laws. The values are calculated for a representative model of KIC 
8366239 as defined in the Supplementary Information. Solid circles, splitting 
for non-rigid rotation for the case of a core rotation ten times faster than the 
surface rotation of 2.5kms_' resembles the observations qualitatively well. 
Open circles, theoretical splittings for rigid rotation and an equatorial surface 
rotation velocity of 3kms ' show a trend opposite to the observed one, with 
the largest splitting in the centre of the dipole forest and lower splitting in 
gravity-dominated modes. In the case of rigid rotation, the variable splitting is 
governed purely by the variation of the Ledoux” constant across the dipole 
forest (Supplementary Fig. 8). Because the representative model (c) has not 
been corrected for surface effects, there is a slight offset to the observations (b). 


The above interpretation is consistent with the correlation between 
the mode lifetime and the corresponding rotational splitting that has 
been observed in our data. Mixed modes in the wings of the dipole 
forest are predicted to have large amplitudes in the central regions of 
the star and, therefore, larger values of inertia and lifetime. These 
modes have narrower mode profiles in the frequency spectrum than 
have the centre modes that are predominantly trapped in the outer 
cavity''. This behaviour of the mode profiles (Supplementary Fig. 1) 
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Figure 3 | Rotational splitting versus mode linewidth for KIC 8366239. The 
lifetime is inversely proportional to the mode linewidth. Black circles, | = 1 
modes. Grey squares, linewidth of the pure acoustic radial modes (/ = 0), for 
comparison. Dotted vertical line, formal frequency resolution. Error bars, 
standard deviation of the measured rotational splitting and mode width of 
dipole modes. Similar diagrams for the two other stars from Table 1 are shown 
in Supplementary Figs 9 and 10, respectively. 
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Table 1 | Observational parameters of three stars, showing rotational splitting. 


KIC Vmax (uHz) R(R5) APobs (S) Test (K) vsini Asteros. rot. /=1 min rot. Averaged split. ratio 
(kms~+) vel. (kms~+) split. (uHz) for dipole modes 
8366239 1822+ 1 5.30 + 0.08 56211 4980 + 120 =) 39 0.135 + 0.008 15 
5356201 209.7 + 0.7 4.47 + 0.03 50+10 4840 + 90 24 3.8 0.154 + 0.003 LF 
12008916 159.9 + 0.6 5.18 + 0.05 5227 4830 + 100 > 5 0.200 + 0.001 18 


We present three cases of firm detections of rotational splitting in red giants from around 500 day-long time series of photometric data obtained with the Kepler satellite in long-cadence mode (about 30-min time 
sampling)*°. KIC, stars’ identifiers in the Kepler Input Catalog. vax, frequency of maximum oscillation power. The stellar radius R and the effective temperature T.;; were computed according to ref. 14. The mean 
period spacing AP», and its standard deviation were derived from the central multiplet components of the dipole modes. These three stars are classified as low-luminosity red-giant stars®*. The observationally 
derived mean period spacing of the mixed modes indicates that they are in an early phase of red-giant evolution’, in which they are burning hydrogen in a shell around the helium core. The projected surface 
velocity, v sini, was determined from ground-based spectra. The asteroseismic rotation velocity was computed from the minimum dipole splitting and assumed rigid rotation. Although this is the most pressure- 
dominated mode found in the spectrum, it contains a contribution from the core. The inferred rotation velocity from this mode is therefore slightly higher than the surface rotation velocity. The observed ratio of the 
average rotational splitting found in the wings of the dipole forest to the mean splitting of the forest centre is given in the last column. Extensive lists of further asteroseismic parameters are given in Supplementary 


Table 1 and 2. 


was recently confirmed by observations**. We observe that short-lived 
modes (with broader profiles) exhibit smaller rotational splitting in 
KIC 8366239 (Fig. 3) and the two other stars from Table 1 (Sup- 
plementary Figs 9 and 10). With increasing lifetime (dominant gravity 
component) we see a substantial increase in the size of the rotational 
splitting of the modes. The frequencies of the narrowest modes are 
mainly affected by the rotation in the central regions of the star. Taking 
a representative model for KIC 8366239, and assuming that the con- 
vective envelope rotates rigidly and that the radiative interior rotates 
rigidly but faster than the convective envelope, we find that the 
observed splitting ratio of 1.5 (Table 1) is obtained from a core rotating 
at least ten times faster than the surface. 

So far, our understanding of the evolution of the angular momentum 
as a function of the evolutionary stage is still poor. Before this study, it 
was possible to deliver evidence of non-rigid rotation in only two core- 
hydrogen-burning massive stars, for which the seismic analysis 
revealed a near-core rotation rate about three to five times faster than 
the envelope rotation rate’’”®. Claims of non-rigid rotation in white 
dwarfs have also been discussed*’. The low-luminosity red giants” 
discussed here are in the early phase of hydrogen-shell burning, and 
have a core-to-envelope rotation rate more than twice as high as those 
of the massive stars’””°. When these stars enter the subsequent phase of 
core-helium burning, we believe that the core will undergo a slight 
expansion and the envelope will shrink, leading to a surface rotation 
on the horizontal branch larger than that on the red-giant branch’. The 
basic assumption of conservation of angular momentum predicts a 
rotational gradient between surface and core that is less steep than 
the one detected. Exploiting rotational splitting of mixed modes in a 
large sample of red giants in various evolutionary stages will provide an 
excellent tool to inspect how the internal angular-momentum distri- 
bution evolves with time towards the end of the life of the star. 
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Open structure of the Ca**t 


gating ring in the 


high-conductance Ca** -activated K* channel 


Peng Yuan’, Manuel D. Leonetti’, Yichun Hsiung’ & Roderick MacKinnon! 


High-conductance voltage- and Ca”*-activated K* channels func- 
tion in many physiological processes that link cell membrane voltage 
and intracellular Ca** concentration, including neuronal electrical 
activity, skeletal and smooth muscle contraction, and hair cell tun- 
ing'*. Like other voltage-dependent K* channels, Ca”*-activated 
K* channels open when the cell membrane depolarizes, but in con- 
trast to other voltage-dependent K* channels, they also open when 
intracellular Ca** concentrations rise. Channel opening by Ca”* is 
made possible by a structure called the gating ring, which is located 
in the cytoplasm. Recent structural studies have defined the Ca’*- 
free, closed, conformation of the gating ring, but the Ca”*-bound, 
open, conformation is not yet known’. Here we present the Ca’*- 
bound conformation of the gating ring. This structure shows how 
one layer of the gating ring, in response to the binding of Ca”*, opens 
like the petals of a flower. The degree to which it opens explains how 
Ca’* binding can open the transmembrane pore. These findings 
present a molecular basis for Ca** activation of K* channels and 
suggest new possibilities for targeting the gating ring to treat con- 
ditions such as asthma and hypertension. 

Regulators of K* conductance (RCK) domains are ubiquitous 
among ion channels and transporters in prokaryotic cells'”*. Eight 
RCK domains assemble in the cytoplasm to form a closed-ring struc- 
ture that changes its diameter on ligand binding, thus enabling the 
gating ring to regulate allosterically the transmembrane component of 
the transport protein. In prokaryotic cells, the gating ring most often 
consists of eight identical RCK domains, arranged as a tetrad of pairs, 
giving rise to a four-fold symmetric ring with identical top and bottom 
layers of RCK domains’. Intracellular ligands such as Ca”* or small 
organic molecules bind in a cleft between RCK pairs forming the top 
and bottom layers to affect the shape of the ring'*'*""”. 

RCK domains are also found in multicellular eukaryotes in the Slo 
family of K* channels?'!*"°. There, two non-identical RCK domains 
are encoded in the carboxy terminus of the K* channel subunit. A 
tetrameric K* channel thus provides eight RCK domains to make a 
gating ring, but in contrast to most prokaryotic gating rings with 
identical RCK domains, the eukaryotic gating ring has one kind of 
RCK domain (RCK1) forming its top layer and another kind (RCK2) 
forming its bottom layer (Fig. 1a). The structure of a Ca?” -activated 
(Slol or BK) K*-channel gating ring in its Ca**-free, closed, con- 
formation was recently determined (Protein Data Bank accession 
3NABF), as was the structure of an RCK1-RCK2 pair in the presence 
of Ca** (PDB accession 3MT5), detailing the chemistry of the Ca**- 
binding site®'*. These studies showed that the BK gating ring indeed 
has distinct top and bottom layers, and also that a Ca”* -binding site 
known as the Ca*" bowl is in a different location than the ligand- 
binding site in prokaryotic gating rings. These unique structural prop- 
erties suggest that to regulate conduction, the eukaryotic gating ring 
may undergo conformational changes in a different manner than its 
prokaryotic counterpart. 

We determined the crystal structure of the Ca”* -bound gating ring 
from the zebrafish BK channel at a resolution of 3.6A using an 
expression construct in which a loop consisting of residues 839-872 


was deleted (Fig. 1b, Supplementary Table 1 and Supplementary Figs 1 
and 2). This loop was disordered in both the Ca*"-free human BK 
gating ring and the Ca” * -bound RCK1-RCK2 pair (PDB accessions as 
above), and its deletion in both the human and zebrafish BK channels 
has no detectable affect on function’’* (Supplementary Fig. 3). 
Therefore, the relative stability of conformational states of the BK 
gating ring seem to be unaffected by this loop. Initial phases were 
determined by molecular replacement using the RCKI-RCK2 Ca’*- 
bound structure as a search model’®. The solution showed the presence 
of eight RCK1-RCK2 pairs (two gating rings) per asymmetric unit of 
the crystal, which allowed eight-fold non-crystallographic symmetry 
averaging of electron density maps and eight-fold restraints throughout 
model refinement. The first step of refinement required rigid-body 
adjustment of the amino-terminal lobe of RCK1. The final model was 
refined to working and free residuals (Rwork and Reree) of 26.0% and 
28.9%, respectively. 

The zebrafish channel is 93% identical in amino-acid sequence to the 
human BK channel, for which the Ca”* -free, presumably closed, con- 
formation is known? (Fig. 1c and sequence alignment in Supplementary 
Information). The high degree of identity between zebrafish and human 
BK channels justifies comparison of the two structures to assess poten- 
tial conformational changes underlying Ca**-mediated gating. The 
Ca?*-bound and Ca’ -free gating rings are shown in Fig. 1b, c. In these 
representations, RCK1, which forms the layer of the gating ring closest 
to the membrane, is coloured blue and RCK2 is coloured red. In both 
gating rings, a green asterisk marks the N-terminal residue of RCK1, 
Lys 343. Ina full-length BK channel, the N-terminal residue of RCK1 is 
connected by a 16-amino-acid linker to the C terminus of the trans- 
membrane inner helix, which forms the pore’s gate”. 

The two gating ring structures, Ca~* -bound and Ca’ * -free, differ in 
two main respects. First, in the Ca**-bound structure electron density 
attributable to Ca”* is present in the Ca** bowl at the ‘assembly 
interface’, and, second, the RCK1 layer in the Ca” -bound gating ring 
is expanded from a diameter of 81 to 93 A, measured at the position of 
Lys 343 (Fig. 1d, e and Supplementary Fig. 2c, d). An objective com- 
parison of conformational differences between the Ca” -bound and 
Ca’*-free gating rings is achieved by superimposing all Ca atoms of 
the tetrameric gating rings. Such a superposition shows that the RCK2 
components of the ring undergo very little change, and that only 
modest conformational differences occur at the assembly interface, 
except locally near the Ca”* bowl (Fig. 2c and Supplementary Fig. 4). 
The Ca”* bowl structure is nearly identical to that of the Ca**-bound 
RCK1-RCK2 pair’* (Supplementary Fig. 2e). The major conforma- 
tional change involves the N-terminal lobe of RCK1, which changes 
its angle with respect to RCK2 (Fig. 2a, b and Supplementary Fig. 5). 
Viewing an interpolative movie in which the Ca’ '-free gating ring is 
morphed to the Ca**-bound gating ring, it seems that Ca** binding 
causes the N-terminal lobes of the RCK1 units to ‘open up’ on the 
membrane-facing surface of the gating ring in a way akin to petals 
opening on a flower (Supplementary Movie 1, part 1). 

We next analysed whether these conformational changes are com- 
patible with prior structural data on the opening and closing of KT 
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Figure 1 | The BK channel and the gating ring. a, Domain topology of the BK 
channel. For clarity, only two opposing subunits are shown. b, Crystal structure 
of the Ca?*-bound, open, gating ring, with RCK1 in blue and RCK2 in red. The 
Ca’* ions are shown as yellow spheres. The N termini of RCK1 that connect to 
the C termini of the inner helices are indicated as green asterisks. c, Structure of 


channels. We started with the open prokaryotic K* channel MthK 
and replaced its gating ring with the Ca”*-bound BK-channel gating 
ring’. This step involved alignment of the four-fold axis followed by 
minimization of the distance between the N terminus of the BK gating 
ring and the corresponding position within the MthK gating ring 
structure, through least-squares minimization of the distances 
between residues 343 of all four BK subunits and the corresponding 
residues, 116, of all four MthK subunits. The resulting model of an 
open BK-channel gating ring with an open K* pore is shown in Fig. 3a. 
The distance between the C-terminal residue of the inner helix of the 
pore and the N-terminal residue of RCK1, which was not constrained 
in making the model, is 33 A. In a second step, we replaced the open 
MthK pore with the closed KcsA pore by aligning the invariant selec- 
tivity filter, and replaced the Ca”* -bound BK-channel gating ring with 
the Ca”* -free BK-channel gating ring by aligning the nearly invariant 
RCK2 layer®”". The resulting model of a Ca”’ -free BK gating ring with 
a closed K™ pore is shown in Fig. 3b. In this closed model, the distance 
between the C-terminal residue of the inner helix and the N-terminal 
residue of RCK1, which again was not constrained in making the 
model, is 32 A. In this analysis, the similar pore-to-gating-ring distance 
(~32 A) in the open model and the closed model suggests that the 
magnitude of conformational change observed within the gating ring 
is compatible with the magnitude of change known to occur in the 
pores of other kinds of K* channel (Supplementary Movie 1, part 2). 
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the Ca?*-free, closed, gating ring, with RCK1 in blue and RCK2 in red (PDB 
accession 3NAF). d, The RCK1 N-terminal lobes (blue) and the Ca** bowls (red) 
from the Ca**-bound, open, gating ring. The diagonal distance between the Ca 
atoms of the N-terminal residues (Lys 343) is indicated. e, The corresponding 
region to that shown in d from the closed gating ring (PDB accession 3NAF). 


Whether or not the pore of the BK channel undergoes conformational 
changes similar to or different than those of other K* channels 
remains to be determined””®. 

Another feature of the model in Fig. 3 is noteworthy. In the pub- 
lished structure of the Ca’ * -free gating ring, part of the linker extend- 
ing centrally from the gating ring towards the pore was present in the 
crystal structure and visible on the surface of the gating ring’. The 
linker was held by a four-helix bundle fused to the N terminus to 
facilitate crystallization. Although the linker position in the structure 
might be affected by this attachment, we note that it is almost coincid- 
ent with the dashed lines in our closed model (Fig. 3b). The agreement 
between the observed linker position in the crystal structure of the 
closed gating ring and its inferred position in the closed conformation 
of our model is an interesting correlation because the radial direction of 
the linkers in the model (dashed lines) would naturally allow the 
expansion of the gating ring to exert an opening force on the pore’s 
inner helices. The idea that the gating ring exerts a direct force on the 
pore has been proposed on the basis of mutagenesis studies showing 
that the linker length is critical to efficient Ca” "-mediated gating!®””. 

The structural differences between the BK gating ring and the MthK 
gating ring underlie distinct ligand-induced conformational changes. 
A higher degree of molecular symmetry in the MthK gating ring stems 
from its containing eight identical RCK domains'*’’. Consequently, 
the MthK gating ring changes equally on both layers such that Ca”* 
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RCK2 RCK1 


RCK2 


Figure 2 | The flexible interface and the assembly interface. a, The flexible 
interface from the open gating ring, with RCK1 in blue and RCK2 in red. The 
Ca’* ions are shown as yellow spheres. The N- and C-terminal lobes of RCK1 
and RCK2 are labelled. Large disordered regions are indicated by dotted lines. 


binding causes a shrinking of the ring’s height and an expansion of 
its diameter'* (Supplementary Movie 2). By contrast, in the BK gating 
ring Ca" binding causes only the layer facing the membrane to 
undergo a substantial conformational change (Supplementary Movie 1). 


Flexible interface 
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Assembly interface 


RCK1 


b, The flexible interface from the closed gating ring, with RCK1 in blue and 
RCK2 in red (PDB accession 3NAF). c, Superposition of the assembly interfaces 
from the open and closed gating ring structures. Secondary structural elements 
oF, aD and «Q are labelled. 


This comparison presents a fascinating example in which the evolution 
of molecular structure has given rise to new or modified mechanical 
properties within a class of molecules. Precisely how the free energy of 
Ca’ * binding in either the BK or MthK gating rings is transduced into 


Figure 3 | A Ca”*-gating model for the BK channel. a, Views of the BK- 
channel model with an open gating ring and an open pore. The structure of the 
Ca’*-bound BK gating ring is docked onto the open MthK channel (PDB 
accession 1LNQ) by aligning the gating rings. Yellow spheres represent Ca”* 
ions. The N-terminal residues of RCK1 and the C-terminal residues of the inner 
helices are shown as black spheres. b, Views of the BK channel model with a 


Ca’*-free gating ring and a closed pore. Here, the Ca’*-bound gating ring in 
ais replaced by the Ca” -free gating ring (PDB accession 3NAF) and the open 
MthK pore ina is replaced by the closed KcsA pore (PDB accession 1K4C). The 
inner helix from KcsA is truncated to match the length in MthK. See 
Supplementary Movie 1. 
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mechanical work remains an important outstanding question that will 
require further experimental and theoretical work to understand at a 
deep level. 

The data and analysis presented here define end points in the Ca”* - 
induced gating conformational transition of the BK gating ring. Given 
the large conformational change that occurs on Ca”” binding, it 
should in principle now be possible to identify small molecules that 
stabilize one conformation or the other. Such small molecules promise 
to have important and possibly beneficial physiological effects because 
BK channels regulate smooth muscle tone in pulmonary airways and 
vascular beds*”~°. 


METHODS SUMMARY 


The cytoplasmic domain construct of the zebrafish BK channel including residues 
341-1,060 with a loop deletion (residues 839-872) was expressed and purified as 
described previously'*. However, the size exclusion chromatography was carried 
out in buffer containing 20mM HEPES (pH 8.0), 150mM NaCl and 20mM 
dithiothreitol. The protein was concentrated to about 6mgml ' and 10mM 
CaCl, was added before crystallization experiments were performed. Crystals were 
grown at 20 °C using hanging-drop vapour diffusion by mixing equal volumes of 
protein anda reservoir solution containing 50 mM 2-(N-morpholino)ethanesulfonic 
acid, 4% (w/v) PEG 4000 and 100 mM potassium sodium tartrate (pH 6.3). The 
crystals belong to space group P2,2,2, and the unit cell has dimensions of 
a= 137.65 A, b= 210.82 A and c = 238.76 A with « = f = y = 90°. Each asym- 
metric unit contains eight protein subunits forming two gating rings. The structure 
was determined by molecular replacement using the monomeric Ca**-bound 
human BK cytoplasmic domain (PDB accession 3MT5) as the search model. 
The final model was refined to a resolution of 3.6A with Ryo. = 0.260 and 
Rgee = 0.289. Electrophysiology recordings were conducted using patch-clamp 
on Xenopus oocytes expressing the wild-type and mutant channels. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cloning, expression and purification. A synthetic gene encoding the zebrafish 
BK channel (GI: 189526846) was purchased from Bio Basic Inc. and served as the 
template for subcloning. The amino-acid sequence is 93% identical to that of the 
human BK channel. On the basis of previous structural and functional studies of 
the human BK channel’*, a cytoplasmic domain (CTD) construct including residues 
341-1,060 with a loop deletion (residues 839-872) was expressed and purified 
following a method previously described'*. However, the size exclusion chromato- 
graphy was carried out in 20mM HEPES (pH 8.0), 150 mM NaCl and 20mM 
dithiothreitol. The protein was concentrated to about 6mgml’ and 10mM 
CaCl, was added to achieve the ligand-bound, open, conformation before crystal- 
lization experiments. 

Crystallization and structure determination. Crystals of the zebrafish BK CTD 
were grown at 20 °C using hanging-drop vapour diffusion by mixing equal volumes of 
protein and a reservoir solution containing 50 mM 2-(N-morpholino)ethanesulfonic 
acid, 4% (w/v) PEG 4000 and 100 mM potassium sodium tartrate (pH 6.3). The 
crystals grew to a maximum size of about 0.2 mm X 0.2 mm X 0.3 mm within three 
days. Before being flash frozen in liquid nitrogen, the crystals were briefly transferred 
to a cryoprotectant solution containing 50 mM 2-(N-morpholino)ethanesulfonic 
acid, 6% (w/v) PEG 4000, 100mM potassium sodium tartrate, 150mM NaCl, 
10 mM CaCl, and 30% ethylene glycol (pH 6.3). The crystals belong to space group 
P2,2,2; and the unit cell has dimensions of a= 137.65A, b=210.82A and 
c= 238.76 A with « = 8 =) = 90°. Each asymmetric unit contains eight protein 
subunits. 

Diffraction data were measured at beamline X29 of the National Synchrotron 
Light Source and were processed with the HKL2000 program suite*'. The scaled 
data set was anisotropically corrected to resolution limits of 4.0, 3.6 and 3.6 A along 
the reciprocal cell directions a*, b*, and c*, respectively, using the diffraction 
anisotropy server at the University of California, Los Angeles’. An isotropic B 
factor of —29.98 A? was applied to restore the magnitude of the high-resolution 
reflections diminished by anisotropic scaling. The structure was determined by 
molecular replacement using the monomeric Ca” -bound human BK CTD (PDB 
accession 3MT5) as the search model"*. Initial phases were improved by eight-fold 
non-crystallographic symmetry averaging carried out with DM from the CCP4 
program suite’. Iterative model building was carried out in COOT™. Rounds of 
refinement were performed with CNS* and REFMAC™, and strong eight-fold 
non-crystallographic symmetry restraints were maintained throughout refine- 
ment. The final model was refined to a resolution of 3.6 A with Ryork = 0.260 
and Rgeo = 0.289. A few disordered regions were not modelled owing to weak 
electron density, and the final refined model includes residues 343-571, 581-614, 
688-810, 817-837, 875-949, 953-1,024 and 1,029-1,059. Residues for which side- 
chain density was poorly defined were modelled as alanines. The majority (95.9%) 
of the residues lie in the most favoured region in a Ramachandran plot, with the 
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remaining 4.1% in the additionally allowed regions. Eight Ca?" ions were 
modelled in the Ca”* bowls from the two gating rings in the crystal. Data collec- 
tion and structure refinement statistics are shown in Supplementary Table 1. All 
structural illustrations were prepared with PYMOL (http://www.pymol.org). The 
multiple-chain morphing script for CNS*’ was used to generate coordinates for the 
movies in Supplementary Information. 

Electrophysiology. Channel expression in Xenopus laevis oocytes and electro- 
physiological recordings in inside-out patches were carried out as previously 
described'*. Wild-type and deletion zebrafish BK constructs were cloned into 
the pGEM-HE vector**’. Plasmids were linearized with Pcil and capped cRNA 
was produced by in vitro transcription from the T7 promoter using the 
AmpliCap-Max T7 kit (Epicentre Biotechnologies). Xenopus laevis oocytes were 
injected with ~10ng of cRNA and recordings were made 48-60h later. For 
recording, inside-out patches were excised from freshly devitellinized oocytes 
using fire-polished glass pipettes with a typical resistance of 1-2 MQ. In Ca’*- 
free conditions, the bath solution contained 140 mM K-MeSO3, 20 mM HEPES, 
2mM KCl, 5mM EGTA (final pH, 7.0). For recordings in 10 tM free Ca’*, the 
same solution was supplemented with 4.83 mM CaCl, adjusting the pH to 7.0 (we 
assumed a Ky of 376nM for the EGTA-Ca”* complex at pH 7.0). The pipette 
solution was the same as the Ca’*-free bath solution, but supplemented with 
2mM MgCl. During recording, the membrane voltage was held at —60 mV 
and currents were elicited by pulsing the voltage from —120 to +140mV in 
10-mV increments (length of pulse, 20ms). For plots of conductance versus 
voltage, measurements were taken from tail currents at -60 mV, 700 ils after the 
voltage step. 
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A disinhibitory microcircuit for associative 
fear learning in the auditory cortex 
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Learning causes a change in how information is processed by neuronal circuits. Whereas synaptic plasticity, an 
important cellular mechanism, has been studied in great detail, we know much less about how learning is 
implemented at the level of neuronal circuits and, in particular, how interactions between distinct types of neurons 
within local networks contribute to the process of learning. Here we show that acquisition of associative fear memories 
depends on the recruitment of a disinhibitory microcircuit in the mouse auditory cortex. Fear-conditioning-associated 
disinhibition in auditory cortex is driven by foot-shock-mediated cholinergic activation of layer 1 interneurons, in turn 
generating inhibition of layer 2/3 parvalbumin-positive interneurons. Importantly, pharmacological or optogenetic 
block of pyramidal neuron disinhibition abolishes fear learning. Together, these data demonstrate that stimulus 
convergence in the auditory cortex is necessary for associative fear learning to complex tones, define the circuit 
elements mediating this convergence and suggest that layer-1-mediated disinhibition is an important mechanism 
underlying learning and information processing in neocortical circuits. 


The transformation of sensory input to an appropriate behavioural 
output is accomplished by neuronal circuits that process information 
through dynamic interactions between distinct types of neurons. 
Learning as an adaptive change of an animal’s behaviour modifies these 
network computations. Important cellular mechanisms underlying 
learning are thought to be activity-dependent synaptic plasticity’ and 
subsequent structural modifications’. In contrast to our detailed under- 
standing of these cellular mechanisms, the factors and circuit elements 
controlling neuronal activity and concomitant induction of plasticity 
during learning in the intact animal remain poorly understood. 

Neocortical interneurons exert powerful control over circuit activity 
by supplying inhibition to surrounding pyramidal neurons and to 
other interneurons’. The balance of excitation and inhibition is critical 
to circuit function, and is maintained under most conditions including 
sensory stimulation, when thalamocortical input typically recruits 
feed-forward inhibition a few milliseconds after direct excitation*”. 
This form of inhibition is mainly supplied by basket cells, the most 
abundant type of interneuron in the rodent neocortex’. Fast-spiking 
basket cells establish strong synapses with high release probability on 
the perisomatic region of pyramidal neurons where they control firing, 
and express the calcium-binding protein parvalbumin (PV)**’”. 

On top of the glutamatergic recruitment of inhibition by sensory 
input, several types of interneurons are also major targets of neuro- 
modulation*""’, suggesting that these systems can profoundly affect 
circuit computations by shifting the excitation—inhibition balance’. 
This has mainly been addressed in vitro, revealing a staggering com- 
plexity of effects which strongly depend on interneuron type*"'. Given 
that neuromodulation has a key role in learning'*", it is likely that 
interneurons are engaged in these processes. However, the contri- 
bution of different interneuron types to learning in the intact animal 
remains elusive. 

Auditory fear conditioning is a form of associative learning acquired 
by temporal coincidence of a neutral conditioned stimulus (CS, a 
tone) with a mild foot shock. Fear conditioning causes prominent, 


long-lasting plasticity of CS responses in auditory cortex'?°, which 
depends on the activity of cholinergic afferents from the basal 
forebrain’’’*"*. Pairing of tones with basal forebrain stimulation can 
elicit similar plasticity in auditory cortex'*”, and a recent study sug- 
gests an involvement of pyramidal neuron disinhibition in this pro- 
cess'’. Here, we investigate the contribution of identified types of 
auditory cortex interneurons to acquisition of auditory fear memory. 
Our results indicate that foot shocks, via recruitment of cholinergic 
basal forebrain afferents, activate a disinhibitory neocortical micro- 
circuit that is required for fear learning. 


Nicotinic activation of L1 interneurons by foot shocks 


Fear memory acquisition critically depends on the amygdala’®. In 
contrast, the role of auditory cortex is contentious, with evidence both 
for’? and against an essential function in auditory fear condition- 
ing’’’. Based on the hypothesis that stimulus complexity determines 
the engagement of neocortex’®, we performed differential fear condi- 
tioning using trains of upward and downward frequency-modulated 
sweeps as the CS (Fig. 1a, modulation between 5 and 15 kHz). To test 
directly the role of auditory cortex in this paradigm, we inhibited 
neuronal activity during fear conditioning by local injection of the 
GABA, (y-aminobutyric acid)-receptor agonist muscimol into the 
auditory cortex (Fig. la and Supplementary Fig. 1). When tested 
24h later in a drug-free state, this manipulation resulted in strongly 
reduced fear levels (Fig. 1b), indicating that activity in auditory cortex 
is required for fear learning in this paradigm. 

We next focused on the circuit mechanisms underlying acquisition 
of fear memory in the auditory cortex (defined here as areas Al and 
AuV). The role of the auditory cortex could be that ofan essential relay 
for tone information to downstream structures like the amygdala. 
Alternatively, foot shocks alone might also elicit activity in the auditory 
cortex, and convergence of tone and foot-shock information during 
fear conditioning could then contribute to fear learning. To distinguish 
these alternatives, we used two-photon calcium imaging of neurons in 
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Figure 1 | Foot-shock responses in auditory cortex L1 interneurons. a, Left, 
injection of the GABA,-receptor agonist muscimol (red) into the auditory 
cortex (blue). Right, differential fear-conditioning protocol using frequency- 
modulated sweeps. b, Drug-free freezing 1 day after conditioning in a new 
context. Compared to vehicle-injected mice (grey), muscimol reduced freezing 
to both CS” and CS* (red). ¢, Cytoarchitecture of upper layers of auditory 
cortex (interneurons grey, pyramidal neuron black). d, Two-photon calcium 
imaging in head-fixed, anaesthetized mice using OGB-1 AM (green) and 
sulforhodamine 101 (red, counterstains glial cells). e, Left, responses in L1 and 
12/3 to foot shocks in single neurons (grey) and the population average (red). 
Right, L1 interneurons display much stronger activation than L2/3 neurons. 
f, Two-photon targeted loose-seal cell-attached recording of L1 interneuron 
(green). g, Example traces (left) and z-scored population peri-stimulus time 
histograms (right) of two types of foot-shock responses in L1 interneurons. 
Inset, incidence of response type. Values are mean + s.e.m. *P < 0.05, 

***P <0),001. Statistical analysis in Supplementary Information. 


the superficial layers of the auditory cortex in mice anaesthetized with 
chlorprothixene and urethane (Fig. 1c, d). Trains of frequency- 
modulated sweeps used as the CS strongly activated neurons in layer 
2/3 (L2/3, Supplementary Fig. 2), whereas mild foot shocks applied to 
the hindpaws evoked little response (Fig. le and Supplementary Fig. 2). 
In marked contrast, the population of neurons located in L1 displayed 
strong activation by foot shocks (Fig. le). Layer 1 is unique in the 
neocortex because it contains only very few neuronal somata, almost 
all of which are GABAergic interneurons”. To investigate their 
activation in more detail, we used two-photon targeted loose-seal 
cell-attached recordings” (Fig. 1f). These experiments showed that 
LI interneurons are tonically active (baseline frequency 3.2 + 0.5 Hz, 
n= 30), and confirmed that the majority of Ll interneurons are 
strongly activated by foot shocks, while a minority (23%) displayed 
inhibition during and after foot shocks (Fig. 1g and Supplementary 
Fig. 3). The excitatory response was also present when foot shocks were 
paired with tones, whereas L1 interneurons did not show pronounced 
responses to tones alone (Supplementary Fig. 4). Interestingly, we 
observed a very similar activation of L1 interneurons by foot shocks 
in the primary visual cortex (Supplementary Fig. 5), indicating that the 
relevance of this pathway may extend beyond auditory memory 
acquisition. 

Layer 1 is a prominent feedback pathway in the neocortex, contain- 
ing both glutamatergic projections from higher cortical areas’’”* (and 
from non-specific thalamic nuclei*’) and cholinergic afferents from 
the basal forebrain, the major source of acetylcholine in the rodent 
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neocortex'**°**, We next sought to identify the afferent pathways 
mediating activation of L1 interneurons during foot shocks. Local 
bath-application of the glutamate receptor antagonist NBQX 
(1 mM) strongly reduced baseline firing frequency (Supplementary 
Fig. 6), but left the foot-shock response in L1 interneurons intact 
(Fig. 2a, b and Supplementary Fig. 6, see below). In contrast, com- 
bined application of the nicotinic acetylcholine receptor (nAChR) 
antagonists mecamylamine and methyllycaconitine (1 and 0.1 mM, 
respectively) abolished the L1 interneuron response almost completely 
(Fig. 2c and Supplementary Fig. 6), and drastically reduced baseline 
firing (Supplementary Fig. 6). In agreement with the interpretation 
that cholinergic basal forebrain afferents generate foot-shock res- 
ponses in LI interneurons, electrical microstimulation of the basal 
forebrain caused strong excitation of L1 interneurons in the absence 
of foot shocks (Fig. 2d and Supplementary Fig. 6). Activation of the 
basal forebrain by foot shocks is expected to have a longer latency than 
thalamocortical signalling’*. Latency analysis showed that L1 inter- 
neuron activation was biphasic, with an early, glutamatergic peak 
(onset 10 to 20 ms after shock onset) which may originate from non- 
lemniscal thalamus”, and a later, nicotinic peak (onset 40 to 50 ms after 
shock onset) that outlasted the foot shock and contained the majority 
of spikes (Fig. 2 insets). As previously reported”, recordings from brain 
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Figure 2 | Nicotinic activation of L1 interneurons by foot shocks. 

a, Population response of L1 interneurons to foot shocks. Insets in a-d are at 
high temporal resolution (10-ms bins). The control response was biphasic 
(onset 10-20 ms after foot-shock onset). b, Local application of the glutamate 
receptor antagonist NBQX left the response intact, but selectively eliminated 
the early peak, indicating its glutamatergic nature (inset, onset latency 40- 

50 ms). ¢, Local block of nAChRs by mecamylamine and methyllycaconitine 
(MEC&MLA) strongly reduced foot-shock responses, selectively eliminating 
the later, broader peak (inset). d, Electrical microstimulation of the basal 
forebrain activated L1 interneurons after 10 to 20 ms (inset, n = 4). e, Whole- 
cell recording of a L1 interneuron (black, soma and dendrites; red, axon) in 
auditory cortex slices during puff application of nicotine (100 1M, 20 ms). 

f, Left, example nicotine responses in control and after bath-application of 
mecamylamine (100 LM) and methyllycaconitine (0.1 |1M; grey, single trials; 
red, average). Right, all L1 interneurons displayed nicotine responses (n = 17), 
which were blocked by the nAChR antagonists (n = 8). Values are 

mean + s.e.m. **P < 0.01. Statistical analysis in Supplementary Information. 
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slices showed that all L1 interneurons showed responses to nicotine 
puffs (Fig. 2e, f) that were blocked by the same antagonists that abol- 
ished foot-shock responses in vivo (Fig. 2f), and could fire L1 inter- 
neurons (Supplementary Fig. 7). In summary, these results indicate 
that activity of cholinergic basal forebrain neurons is both necessary 
and sufficient to fire L1 interneurons during foot shocks, and that 
acetylcholine activates nAChRs on L1 interneurons. This implies that 
acetylcholine is released rapidly (<50 ms) after an aversive stimulus. 
Activation of L1 interneurons in turn is likely to have a central role in 
fear-conditioning-related plasticity in the cortex. 


L1 interneurons inhibit L2/3 PV* interneurons 


How do foot-shock responses in L1 interneurons affect processing in 
the local microcircuit? There is evidence that L1 interneurons can 
inhibit interneurons in L2/3 during nicotinic activation”. Fast-spiking, 
PV~ interneurons are the most abundant type of interneuron in the 
neocortex’. We injected a conditional adeno-associated virus (AAV) 
expressing a fluorescent marker (venus) into auditory cortex of PV- 
ires-Cre mice* to label these cells selectively (Fig. 3a). Targeted record- 
ings revealed that PV~ interneurons in L2/3 are tonically active under 
baseline conditions (5.9 + 1.2 Hz, n = 17), similar to fast-spiking inter- 
neurons in awake, head-fixed mice**. Foot shocks caused prominent, 
long-lasting inhibition of firing in the majority of PV* interneurons 
(88%, Fig. 3b and Supplementary Fig. 8), while the remaining two 
neurons displayed an excitatory response (Supplementary Fig. 9). 
Inhibition of PV* interneurons was strongly reduced by the nAChR 
antagonists mecamylamine and methyllycaconitine (Fig. 3b and 
Supplementary Fig. 8), which also blocked excitation of L1 interneurons 
by foot shocks (Fig. 2c). Because PV" fast-spiking interneurons lack 
intrinsic responses to acetylcholine’®”’, this result is consistent with 
direct inhibition of PV interneurons by L1 interneurons. In line with 
this interpretation, we observed morphological and functional synaptic 
contacts between L1 interneurons and PV~ interneurons (Supplemen- 
tary Fig. 10). In addition, the time course of foot-shock responses in the 
two populations matches (Supplementary Fig. 11). Taken together, 
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Figure 3 | Aversive shocks inhibit layer 2/3 PV* interneurons. a, Expression 
of venus in L2/3 PV* interneurons. b, Cell-attached recordings from pvt 
interneurons in control (top) show strong inhibition of spontaneous firing by 
foot shocks (left, example traces; right, population response). This response was 
strongly reduced in the presence of mecamylamine and methyllycaconitine 
(bottom). c, Single-unit recordings in the superficial layers of auditory cortex in 
freely behaving mice. d, Putative interneurons (red) were separated from 
putative pyramidal neurons (black) by unsupervised cluster analysis. Inset, 
example action potential (AP) waveforms (scale bar 400 1s). e, z-scored activity 
of putative interneurons showing inhibition by periorbital shocks during fear 
conditioning. Note recruitment by the CS* and subsequent inhibition by the 
shock (n = 9 of 24 putative interneurons, Supplementary Fig. 12). 
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these data indicate that L2/3 fast-spiking PV interneurons are inhibited 
by L1 interneurons during foot shocks. 

To test whether this mechanism is also engaged in awake, freely 
moving animals, we implanted mice with single-unit recording elec- 
trodes in the superficial layers of the auditory cortex (Fig. 3c and 
Supplementary Fig. 12). Putative interneurons were distinguished 
from putative pyramidal neurons using unsupervised cluster analysis 
(Fig. 3d and Supplementary Fig. 12). Recordings during fear condi- 
tioning confirmed that a large population of putative interneurons is 
inhibited during and after an aversive shock (Fig. 3e and Supplemen- 
tary Fig. 12). The same neurons were activated by the CS, indicating 
that the shock removes feed-forward inhibition in pyramidal neurons 
during auditory input. A similar proportion of putative interneurons 
displayed either excitation or no response to shocks (Supplementary 
Fig. 12). These data are consistent with the interpretation that excita- 
tion of Ll interneurons by aversive stimuli serves to remove both 
spontaneous and feed-forward inhibition provided by PV inter- 
neurons to surrounding pyramidal neurons in behaving mice. 


Disinhibition of L2/3 pyramidal neurons 


PV* basket cells provide strong, perisomatic inhibition to local 
pyramidal neurons**”"°. To test directly whether disinhibition is 
the main effect of foot shocks in auditory cortex L2/3 pyramidal cells, 
we used whole-cell recordings. Foot shocks elicited a depolarization 
from rest in all neurons tested (5 + 1.1 mV, n = 6, Fig. 4a). The ampli- 
tude increased at more depolarized membrane potentials (Sup- 
plementary Fig. 13), consistent with a response mediated by strong 
disinhibition and weak excitation**. Application of nAChR antagonists 
converted the foot-shock response to a net inhibition (Fig. 4a), suggest- 
ing an involvement of the disinhibitory circuit described here, and 
perhaps indicating that its block unmasks a component of inhibition 
recruited by foot shocks. Finally, recordings under conditions isolating 
inhibitory postsynaptic currents (IPSCs)’” revealed a drastic reduction 
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Figure 4 | Aversive shocks disinhibit L2/3 pyramidal neurons. a, Top left, in 
vivo whole-cell current-clamp recording of an auditory cortex pyramidal 
neuron during foot shocks. Bottom left, foot-shock response in the presence of 
mecamylamine and methyllycaconitine. Response area recorded at rest was 
reduced by nAChR antagonists (right). b, Whole-cell voltage-clamp recordings 
of inhibitory postsynaptic currents in pyramidal neurons show a strong 
reduction in IPSC frequency during and after foot shocks (top, example traces; 
bottom, population data, Vj,.1q0 to +20 mV). ¢, Left, example response of a L2/ 
3 neuron to frequency-modulated sweeps (top) and sweeps paired with foot 
shocks (bottom) recorded with two-photon calcium imaging. Right, foot 
shocks caused a threefold enhancement of sweep response area in L2/3. 

d, Single-unit recordings of putative pyramidal neurons with significant 
response to the CS* in freely-behaving mice during fear conditioning. Note 
strong activation by coincidence of sweep and shock. Values are mean + s.e.m. 
**P< 0.01, ***P < 0.001. Statistical analysis in Supplementary Information. 
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of IPSC frequency during and after the foot shock (Fig. 4b). These 
results indicate that inhibition of PV" interneurons is a dominant 
influence shaping foot-shock responses in pyramidal neurons, and 
are in line with the observation that basal forebrain stimulation causes 
disinhibition of pyramidal cells’’. 

Given that acetylcholine can affect many stages of neocortical 
information processing*', we next aimed to determine how sensory 
input interacts with foot-shock-mediated disinhibition. We imaged 
calcium responses to tones in L2/3 of auditory cortex, and compared 
them to presentations of the same tones in conjunction with foot 
shocks (Fig. 4c and Supplementary Fig. 14). This experiment revealed 
that foot shocks cause a strong enhancement of the calcium signal 
integral, suggesting that tone/shock compounds elicit much greater 
activity than tones alone. A similar observation was made with extra- 
cellular recordings in freely moving mice, where coincidence of tone 
and shock excited putative pyramidal neurons much more than tone 
alone (Fig. 4d). This effect was highly supralinear in both experiments 
because shocks alone elicited almost no activity in these neurons 
(Supplementary Fig. 14). In summary, we provide evidence that L2/ 
3 pyramidal neurons are disinhibited by aversive stimuli via inhibition 
of PV* interneurons. 


Fear learning requires auditory cortex disinhibition 

To determine whether activation of nAChRs contributes to fear learn- 
ing, we applied mecamylamine and methyllycaconitine locally into 
the auditory cortex before fear conditioning (Fig. 5a and Supplemen- 
tary Fig. 1). When tested 24h later in drug-free state, this manipula- 
tion resulted in strongly reduced fear levels (Fig. 5b), consistent with a 
requirement for nicotinic activation of L1 interneurons. However, L1 
interneurons are not the only circuit elements expressing 
nAChRs**"’. To test further whether disinhibition specifically during 
the foot shock contributes to fear learning, we used channelrhodopsin-2 
(ChR-2)*” expression in PV™ interneurons (Fig. 5c and Supplementary 
Figs 15 and 16). Mice with chronic, bilateral optic fibre implantation 
(Fig. 5d) were subjected to differential fear conditioning in which 
optogenetic stimulation of PV* interneurons occurred during and 
for 5s after the foot shock (Fig. 5e), the period during which we 
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Figure 5 | Auditory cortex disinhibition is required for fear learning. 

a, Injection of mecamylamine and methyllycaconitine (red) into auditory 
cortex (blue) before fear conditioning. b, Drug-free freezing in a novel context 
one day after conditioning. Compared to vehicle-injected mice (grey), nAChR 
antagonism reduced freezing to both CS” and CS* (red). c, Stimulation of 
ChR-2 expressing PV" interneurons (green) in auditory cortex (red) via an 
optic fibre (blue). d, Optogenetic manipulation in freely behaving mice. 

e, Differential fear conditioning protocol with optogenetic stimulation during 
and for 5 s after every foot shock. f, Freezing in a novel context without laser 
stimulation one day after conditioning. Compared to identically treated sham- 
injected littermates (black), virus-injected mice (blue) exhibit drastically 
reduced freezing to the CS*. Reconditioning without optogenetic stimulation 
yielded strongly enhanced freezing (red) that was indistinguishable from sham. 
Values are mean ~ s.e.m. **P< 0.01, ***P < 0.001. Statistical analysis in 
Supplementary Information. 
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observed inhibition of these neurons (Fig. 3b). When tested 24h later 
without optogenetic stimulation, these mice showed strongly reduced 
fear responses to the conditioned stimulus compared to sham-injected 
littermates (Fig. 5f). Reconditioning without optogenetic manipulation 
yielded normal fear learning (Fig. 5f). In addition, we ruled out the 
possibility that foot shock perception was perturbed by optogenetic 
stimulation and that laser illumination itself was perceived as a condi- 
tioned stimulus (Supplementary Fig. 17). Together, these data indicate 
that nicotinic disinhibition of the auditory cortex selectively during 
foot shock is required for associative fear learning. 


Discussion 


Using targeted recordings from identified populations of auditory 
cortex neurons in conjunction with single-unit recordings, phar- 
macological and optogenetic manipulations in behaving mice, we 
have identified a disinhibitory microcircuit required for associative 
learning. Our data show that L1 interneurons play a central role in 
conveying information about an aversive stimulus to auditory cortex. 
Activity of L1 interneurons was tightly controlled by endogenous 
acetylcholine released from basal forebrain cholinergic projections, 
which determined baseline firing and acutely activated the majority of 
L1 interneurons during foot shocks, while a sub-set responded with 
inhibition. Given that all Ll interneurons express functional 
nAChRs” (Fig. 2), a likely source of this inhibition are synaptic inter- 
actions within L1 (ref. 25). Layer 1 contains several morphologically 
distinct subtypes of interneurons” ****, and it remains to be deter- 
mined whether foot-shock response type correlates with morphology. 
A strikingly similar activation by foot shocks was also observed in the 
primary visual cortex, which could underlie aspects of contextual fear 
learning and indicates that cholinergic activation of L1 interneurons 
may be a general feature of neocortical organization. Together, these 
results add to mounting evidence that L1 is a prominent locus of 
feedback signalling in the neocortex’’*** and begin to define how 
feedback signals interact with thalamocortical feed-forward informa- 
tion during memory acquisition. Interestingly, L1 interneurons also 
receive prominent corticocortical feedback’, and are responsive to 
dopamine” and serotonin™, indicating that, depending on the nature 
of the learning task, different systems can feed into the microcircuit 
described here. 

Foot shocks reduced both spontaneous and feed-forward periso- 
matic inhibition provided by basket cells, thus disinhibiting pyramidal 
neurons. Importantly, we cannot rule out that electrotonically remote 
dendritic sites received unchanged or even increased inhibition, for 
instance from L1 neurogliaform cells****, which could serve to com- 
partmentalize induction of synaptic plasticity. Nicotinic enhancement 
of inhibitory input to pyramidal neurons has been observed in vitro". 
Therefore, the level of ongoing inhibition onto pyramidal neurons, as a 
prerequisite for disinhibition, may determine whether the net effect of 
nAChR activation is inhibitory or disinhibitory. Disinhibition is an 
attractive mechanism for learning because it is permissive for strong 
activation and concomitant plasticity induction, but not necessarily 
causative. Rather, the available sensory input at the time of the aversive 
stimulus can select the circuit elements for plasticity induction in a 
bottom-up fashion. In addition, other cholinergic actions like enhance- 
ment of thalamocortical transmission'’* and reduced GABA release 
from basket cell synapses’? may have acted in synergy with the micro- 
circuit described here to boost sensory responses. 

Disinhibition of pyramidal neurons by foot shocks in turn probably 
gated the induction of activity-dependent plasticity in the auditory 
cortex'®’” and at cortical afferents to the amygdala’*. In parallel, cho- 
linergic activation of L1 interneurons may also contribute to memory 
expression, because basal forebrain neurons acquire a conditioned 
response during learning*'**. Irrespective of the plasticity loci, our 
results delineate a role for the auditory cortex in fear conditioning 
to complex tones that goes beyond mere signalling of tone informa- 
tion to the amygdala'**'. Rather, we observe that stimulus conver- 
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gence and concomitant auditory cortex disinhibition are essential for 
fear learning. It is likely that the use of complex, naturalistic tones has 
emphasized the role of the auditory cortex'*, because fear condition- 
ing to pure tones is often unaffected by auditory cortex lesions’””' (but 
see refs 19 and 20). 

Beyond memory formation, cholinergic transmission is well 
known to mediate functions such as arousal and attention’®’’. 
Interestingly, the transition from quiet wakefulness to active whisking 
is associated with a sharp drop in firing of fast-spiking interneurons in 
the barrel cortex**, which is reminiscent of foot-shock responses in 
these neurons, and perhaps implies that cholinergic gating by inhibi- 
tion of PV" interneurons is a general hallmark of active brain states. 


METHODS SUMMARY 

Male C57BL6/J mice (1.5 to 6months old) were used. Full details of animals, 
materials and methods used for two-photon calcium imaging***’, patch-clamp”® 
and single-unit recordings, pharmacological and optogenetic intervention®”” and 
behaviour*® are provided in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Animals. Male C57BL6/J mice (1.5-6 months old, Harlan and Janvier) were 
housed under a 12h light/dark cycle, and provided with food and water ad 
libitum. Prior to fear conditioning, mice were individually housed for = 7 days 
and habituated to handling = 5 times. All animal procedures were executed in 
accordance with institutional guidelines and the prescribed authorities 
(Veterinary Department of the Canton of Basel-Stadt; Switzerland or Ministere 
de agriculture et de la forét, 87-848, France and European Economic 
Community, EEC, 86-6091). 
Head-fixed two-photon imaging and targeted patch-clamp recordings. Mice 
were sedated with chlorprothixene (5 mgkg ', intraperitoneal injection), kept 
warm on a hot-water bottle and after 10 min anaesthetized with urethane (0.9 g 
per kg, intraperitoneal injection). Core body temperature was maintained at 
37.5 °C by a feedback-controlled heating pad (FHC). Mice were fixed in a stereo- 
taxic frame (Kopf Instruments), and local analgesia was provided by injection of 
ropivacain under the scalp (10 mg ml‘, Naropin, AstraZeneca). After retraction 
of the scalp, the location of the area of interest was determined as follows. 
Auditory cortex: 2.46 mm posterior of bregma, 4.5 to 4.7 mm lateral of midline; 
primary visual cortex: 3.16 mm posterior of bregma, 2.5 mm lateral of midline. A 
custom-built head-plate was glued to the skull with cyanoacrylate glue (Ultra Gel, 
Henkel) and dental cement (Paladur, Heraeus), and a small craniotomy was 
performed (2-3-mm diameter). The dura was retracted, the exposed cortical 
surface was superfused with normal rat ringer (NRR) containing (in mM): 135 
NaCl, 5.4 KCl, 5 HEPES, 1.8 CaCl,, pH 7.2 with NaOH), covered with agarose 
(1% type-III-A in NRR) and fixed with a coverslip. For two-photon calcium 
imaging, the membrane permeant dye Oregon Green BAPTA-1 acetoxymethyl 
ester (OGB-1, 1 mM in NRR with 8% DMSO and 2% pluronic F-127, Invitrogen) 
was pressure-ejected from a patch pipette under two-photon visualization. After 
1h, this lead to intracellular staining of virtually all cells in a diameter of approxi- 
mately 300 jum“*. Localization of the labelled area in auditory cortex was verified 
post hoc in slices. Sulforhodamine 101 (0.2 mM in NRR) was bath-applied for 
5 min to the exposed cortex to counterstain glial cells**. Two-photon calcium 
imaging was performed using a scanning microscope (Fluoview 1000MPE, 
Olympus) coupled to a femtosecond-pulse infrared laser at 830 nm wavelength 
(MaiTai HP, Spectra-Physics). The beam was adjusted with a telescope to fill the 
back-focal plane of the X20 water-immersion objective (0.95 numerical aperture, 
Olympus). Average power in the back-focal plane was <120 mW. Frames were 
acquired (Fluoview software, Olympus) at 15 Hz at depths below the pia ranging 
from 20 to 400 um. Time stacks were processed offline using ImageJ to extract AF/ 
F for each neuron. Foot shocks (1 s, direct current, 0.4 to 2mA, 10-20 interval) 
were delivered to the hindpaws. Frequency-modulated sweeps (logarithmically 
modulated between 5 and 15 kHz, 50 ms rise and fall) were generated (System3, 
Tucker Davis Technologies) and presented at 70 dB sound pressure level from an 
electrostatic speaker (ES1, TDT) mounted in front of the animal. 
Two-photon-guided patch-clamp recordings were performed using patch pip- 
ettes of 4-10 MQ resistance filled with NRR for cell-attached recordings, for 
whole-cell current-clamp recordings with intracellular solution containing 
(inmM): 130 K-methanesulphonate, 6.3 KCl, 20 Na2-phosphocreatine, 4 Mg- 
ATP, 0.3 Na-GTP, 10 HEPES, 290 mOsm, pH 7.3 with KOH and for whole-cell 
voltage-clamp recordings with intracellular solution containing (in mM): 125 Cs- 
methanesulphonate, 2 CsCl, 10 Na-phosphocreatine, 4 Mg-ATP, 0.3 Na-GTP, 5 
TEACI, 3.5 QX-314, 0.5 EGTA, 10 HEPES, 290mOsm, pH7.3 with CsOH. 
Pipette solutions were supplemented with Alexa 488 or Alexa 594 (25-50 UM, 
Invitrogen). Neurons of interest were identified by either marker expression (L1 
interneurons in GAD67-GEP mice, PV~ interneurons in PV-ires-Cre mice™ 
injected with AAV conditionally expressing venus), or by imaging the ‘shadows’ 
created by neurons in dye-filled extracellular space in wild-type mice (L1 inter- 
neurons, L2/3 pyramidal neurons). L1 interneurons were located between 30 and 
70 kum below the pia. Cell-attached and whole-cell recordings were established 
using a X40 water-immersion objective (0.8 numerical aperture, Olympus) 
according to standard procedures**. Cell-attached recordings were performed 
in current-clamp or voltage-clamp configuration, and rejected when no action 
potential occurred during the entire length of the recording (=5 min). Whole-cell 
current-clamp recordings were rejected when the series resistance exceeded 
70 MQ. Whole-cell voltage-clamp recordings were performed at 0-20 mV hold- 
ing potential and rejected when the series resistance exceeded 50 MQ. IPSCs were 
detected using a derivative threshold of 40 pA ms_'. Pyramidal neurons were 
identified after dye filling by pyramidal morphology and spiny dendrites. 
Signals were recorded (Axopatch 200B, Molecular Devices or ELC-03XS, NPI), 
low-pass filtered at 20 kHz and digitized at 50 kHz (Digidata 1322A, Molecular 
Devices) using pClamp software (Molecular Devices). For cell-attached record- 
ings, action potentials were detected based on amplitude. Peri-stimulus time 
histograms were computed across the entire population of recordings of a given 


type. Microstimulation of the basal forebrain (20 pulses of 0.2 ms duration delivered 
at 100 Hz) was performed using bipolar stimulating electrodes at the following 
coordinates: 0.8 mm posterior of bregma, 1.75 mm lateral of midline, 4.3 mm below 
cortical surface. Drugs were present in the bath solution throughout the experiment. 
Unless stated otherwise, chemicals were obtained from Sigma. 

Surgery and local drug injection. Mice were anaesthetized with isoflurane 
(induction, 5%; maintenance, 1.5%) in oxygen-enriched air (Oxymat 3, 
Weinmann) and fixed in a stereotaxic frame (Kopf Instruments). Core body 
temperature was maintained at 36.5°C by a feedback-controlled heating pad 
(FHC). Analgesia was provided by local injection of ropivacain under the scalp 
(Naropin, AstraZeneca) and systemic injection of meloxicam (1001 of 
5mg ml}, intraperitoneal, Metacam, Boehringer-Ingelheim). Guide cannulae 
(26 gauge, with dummy screw caps, Plastics One) were implanted bilaterally to 
inject at the following coordinates: 2.46mm posterior of bregma, + 4.5mm 
lateral of midline, 0.6 mm below cortical surface. Implants were fixed to the skull 
with cyanoacrylate glue (Ultra Gel, Henkel) and dental cement (Paladur, 
Heraeus). Mice were then given 1 week to recover from surgery, during which 
time they were handled = 5 times to habituate them to the injection procedure. 
Fifteen minutes before fear conditioning, 32-gauge stainless steel injectors 
attached to 25 tl Hamilton syringes were inserted into the guide cannulae and 
an injection volume of 0.2 l per hemisphere was delivered within 40s using a 
microinfusion pump (Stoelting). Drug animals received bilateral injections of 
muscimol (100 ng per hemisphere) or a combination of mecamylamine (10 pg) 
and methyllycaconitine (10 1g) dissolved in NRR containing 1% fast green 
(Serva). Control mice were injected with vehicle solution only. In a subset of 
mice, fluorescent muscimol bodipy (625 1M in NRR with 5% DMSO) was 
injected after fear memory retrieval to quantify spread of the drug. After com- 
pletion of the experiment, mice were transcardially perfused with 4% para- 
formaldehyde in phosphate-buffered saline (PFA), their brains extracted and 
post-fixed in paraformaldehyde overnight. For histological verification of the 
injection site, 60-j1m coronal brain sections were made on a vibratome (Leica 
Microsystems) and imaged on a stereoscope (Leica Microsystems). 

Virus injection and optogenetics. For labelling of PV~ interneurons (Fig. 3a, b), 
an adeno-associated virus (AAV, serotype 2/7 (venus) or 2/9 (tdTomato), Vector 
Core, University of Pennsylvania) was used to deliver a DNA construct for 
conditional, Cre-dependent expression of venus or tdTomato. AAV (approxi- 
mately 0.5 tl per hemisphere) was injected from glass pipettes (tip diameter 10- 
20m) connected to a picospritzer (Parker Hannifin Corporation) into the 
auditory cortex of PV-ires-Cre mice*’ at the following coordinates: 2.46 mm 
posterior of bregma, 4.5 mm lateral of midline, 0 to 1.1 mm below cortical surface. 
Experiments were performed after 0.5-3 months of expression time. For optical 
control of PV* interneurons (Fig. 5c-f), the same methodology was used for 
bilateral injection of a conditional AAV coexpressing channelrhodopsin-2 
(ChR-2)*” and venus (AAV2/7_ EFla::DIO-ChR2(H134R)-2A-NpHR-2A- 
Venus”) into the auditory cortex of PV-ires-Cre mice. These animals were addi- 
tionally implanted with custom-built connectors holding optic fibres (0.48 
numerical aperture, 200-l1m diameter, Thorlabs). Fibre ends were inserted 
200 um into neocortex at the injection site. Implants were fixed to the skull with 
cyanoacrylate glue (Ultra Gel, Henkel) and dental cement (Paladur, Heraeus). 
After 2-4 months of expression time and habituation to handling, both implanted 
connectors were linked to a custom-built laser bench via optic fibres suspended 
over the conditioning context. This arrangement allowed the animals to move 
freely in the context. Stimulation of ChR-2 (<20 mW per implanted fibre) was 
delivered during and for 5s after each foot shock (Fig. 5e) using a custom-built 
laser bench (laser: MBL473, 473-nm wavelength, CNI Lasers). One day after 
conditioning, mice were exposed to CS* and CS” without optogenetic interven- 
tion in a neutral context. The optogenetic approach was previously validated in 
vitro’ and in vivo", and here by acute extracellular recordings from auditory 
cortex of infected animals (Supplementary Fig. 16). Custom-built optrodes (optic 
fibre with 16-wire electrode attached) were lowered into the infected area. Laser 
pulses (300 ms) were delivered and recordings performed as detailed in section 
‘Extracellular recordings in freely behaving mice’. After completion of the experi- 
ment, mice were transcardially perfused with 4% paraformaldehyde. Brains were 
post-fixed in paraformaldehyde overnight at 4°C, and cut into 80-um thick 
coronal slices on a vibratome (Leica Microsystems). To improve the fluorescent 
signal, an immunostaining was performed. Slices were kept in blocking solution 
(3% BSA, 0.2% Triton in 0.1 M PBS) for 1 h at room temperature, before applica- 
tion of the primary antibody (Goat anti-GFP, Abcam; 1:500 in blocking solution) 
and incubated at 4°C overnight. After washing, slices were incubated with 
secondary antibody (Alexa Fluor 488, donkey anti goat, Invitrogen; 1:1,000 in 
PBS with 3% BSA) at 4°C overnight. After a final wash, slices were mounted on 
coverslips and imaged. 


©2011 Macmillan Publishers Limited. All rights reserved 


Behaviour. Fear conditioning and fear retrieval took place in two different con- 
texts (context A and B). The conditioning and test boxes and the floor were 
cleaned before and after each session with 70% ethanol or 1% acetic acid, respect- 
ively. To score freezing behaviour, an automatic infrared beam detection system 
placed on the bottom of the experimental chambers (Coulbourn Instruments) 
was used. Mice were considered to be freezing if no movement was detected for 2 s 
and the measure was expressed as a percentage of time spent freezing. To ensure 
that our automatic system scores freezing rather than just immobility, we prev- 
iously compared the values obtained with those measured using a classical time- 
sampling procedure during which an experimenter blind to the experimental 
conditions determined the mice to be freezing or not freezing every 2s (defined 
as the complete absence of movement except for respiratory movements). The 
values obtained were 95% identical and the automatic detection system was 
therefore used throughout the experimental sessions. Conditioned stimuli for 
differential fear conditioning were 30-s trains of frequency-modulated sweeps 
(500 ms duration, logarithmically modulated between 5 and 15 kHz, 50 ms rise 
and fall) delivered at 1 Hz at a sound pressure level of 70 dB. The CS* (upsweep) 
was paired with a foot shock (1s, 0.6 mA, 15 CS*-foot-shock pairings; inter-trial 
interval: 20-180 s). The onset of the foot shock coincided with the onset of the last 
sweep in the CS*. The CS” (downsweep) was presented after each CS*-foot- 
shock association, but was never reinforced (15 CS” presentations, inter-trial 
interval: 20-180 s). On the next day, conditioned mice were submitted to fear 
retrieval in context B, during which they received 4 and 4 presentations of the CS” 
and the CS”, respectively. 

Extracellular recordings in freely behaving mice. Surgical procedures are 
described earlier. Mice were secured in a stereotaxic frame and implanted with a 
pair of insulated silver wires (170-,1m diameter) beneath the skin of each eyelid for 
delivery of periorbital shocks. In addition, mice were unilaterally implanted in the 
auditory cortex with a multi-wire electrode aimed at the following coordinates: 
2.46 mm posterior of bregma, 4.5 mm lateral to midline, and 0.6 mm to 0.85 mm 
below the cortical surface. Electrodes consisted of 16 individually insulated, gold- 
plated nichrome wires (13 tm inner diameter, impedance 30 to 100 kQ, Sandvik) 
contained in a 26-gauge stainless steel guide cannula. The wires were attached toa 
connector (18 pin, Omnetics). The implant was secured using cyanoacrylate 
adhesive gel. After surgery mice were allowed to recover for 7 days. Analgesia 
was applied before, and during the 3 days after surgery (Metacam, Boehringer- 
Ingelheim). Electrodes were connected to a headstage (Plexon) containing 16 
unity-gain operational amplifiers. The headstage was connected to a 16-channel 
computer-controlled preamplifier (gain 100X, bandpass filter from 150 Hz to 
9 kHz, Plexon). Neuronal activity was digitized at 40 kHz, bandpass filtered from 
250 Hz to 8kHz, and isolated by time-amplitude window discrimination and 
template matching using a multichannel acquisition processor system (Plexon). 
Single-unit spike sorting was performed using an off-line spike sorter (OFSS, 
Plexon). Principal component scores were calculated for unsorted waveforms 
and plotted on three-dimensional principal component spaces, and clusters con- 
taining similar valid waveforms were manually defined. A group of waveforms was 
considered to originate from a single neuron if it defined a discrete cluster in 
principal component space that was distinct from clusters for other units, and if 
it displayed a clear refractory period (1 ms) in auto-correlograms. Template wave- 
forms were then calculated for well-separated clusters and stored for further ana- 
lysis. To avoid analysis of the same neuron recorded on different channels, we 
computed cross-correlation histograms. If a target neuron displayed a peak of 
activity at a time that the reference neuron fired, only one of the two neurons 
was considered for further analysis. To separate putative inhibitory interneurons 
from putative excitatory pyramidal neurons we used an unsupervised cluster 
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algorithm based on the Ward’s method. Briefly, the Euclidian distance was calcu- 
lated between all cell pairs based on the two-dimensional space defined by each 
cell’s average spike width (measured from trough to peak) and baseline firing rate. 
An iterative agglomerative procedure was then used to combine cells into groups 
based on the matrix of distances such that the total number of groups was reduced 
to give the smallest possible increase in the within-group sum of square deviation. 
Mice were presented with the same conditioned stimuli used for fear conditioning 
(see above). The CS* was paired with a periorbital shock (2s, 2.5mA, 15 cst- 
shock pairings, inter-trial interval 20-180 s), which was used instead of foot shocks 
to minimize electrical artefacts in the recording. The onset of the periorbital shock 
coincided with the onset of the last sweep. The CS” was presented after each CS* 
(15 CS’ presentations, inter-trial interval: 20-180 s). CS* and CS~ were counter- 
balanced across animals. At the conclusion of the experiment, recording sites were 
marked with electrolytic lesions before perfusion, and electrode locations were 
reconstructed with standard histological techniques 

In vitro electrophysiology. Coronal sections (300-j1m thick) of auditory cortex 
were prepared from mice (6-8 weeks old) in ice-cold slicing artificial cerebrosp- 
inal fluid (ACSF) containing (in mM): 124 NaCl, 1.25 NaH2PO,, 10 MgSOg, 2.7 
KCl, 26 NaHCO3, 2 CaCls, 10 Glucose, 4 ascorbate (95% O3/5% CO3). Slices were 
incubated for 45 min at 37 °C in an interface chamber, and then allowed to cool to 
room temperature. Recordings were performed at 34 °C under infrared videomi- 
croscopy in ACSF containing (in mM): 124 NaCl, 1.25 NaH,POg, 1.3 MgSO, 2.7 
KCl, 26 NaHCO, 2 CaCl, 10 glucose, 4 ascorbate (95% O2/5% CO2). Whole-cell 
current-clamp recordings were performed with patch pipettes (4-10 MQ resist- 
ance) filled with intracellular solution containing (in mM): 130 methanesulpho- 
nate, 6.3 KCl, 20 Na2-phosphocreatine, 0.3 Na-GTP, 4 Mg-ATP, 10 HEPES, 5 
biocytin, 290 mOsm, pH 7.3 with KOH. Signals were recorded (Multiclamp 700B, 
Molecular Devices), low-pass filtered at 10 kHz and digitized at 20 kHz (Digidata 
1322A, Molecular Devices) using pClamp9 software (Molecular Devices). Signals 
were analysed using Igor Pro (Wavemetrics). The border between L1 and L2/3 
was visually identified by the abrupt change in cell number, and only L1 inter- 
neurons at least 15 jm away from the border region were chosen for recordings. 
Local puff application of nicotine (100 4M, 20-80 ms duration) was delivered 
from a patch pipette connected to a picospritzer (Parker Hannifin Corporation). 
For paired recordings between L1 interneurons and L2/3 PV* interneurons, PV- 
ires-Cre mice were injected with an AAV (Vector Core, University of 
Pennsylvania) leading to Cre-dependent expression of tdTomato. Fluorescent 
PV* interneurons were identified using TillvisION (Till Photonics) and double 
recordings were performed as described above. Unless stated otherwise, drugs 
were bath-applied. To reveal morphology, slices were fixed in 4% paraformalde- 
hyde (4 °C overnight), washed and stained for biocytin (4 days at 4 °C in PBS, 0.5- 
1% Triton-X, 0.2% Alexa (488, 568 or 680) streptavidin conjugate, Invitrogen). To 
identify putative synaptic contacts, this was combined with immunostaining 
against venus (see above) or tdTomato (Rabbit anti-RFP, MBL, Nunningen; 
1:500 in blocking solution; Alexa Fluor 568, donkey anti-rabbit, Invitrogen; 
1:1,000 in PBS with 3% BSA). Slices were mounted and confocal images acquired 
using an LSM700 confocal laser scanning microscope (Zeiss). Stacks were ana- 
lysed using Image] (NIH) and single representative cells were reconstructed using 
Neurolucida software (MBF Bioscience). Putative synaptic contacts were scored 
when the axon of a L1 interneuron was located within 1 um from a PV~ dendrite 
or soma. 


47. Tang, W. etal. Faithful expression of multiple proteins via 2A-peptide self- 
processing: a versatile and reliable method for manipulating brain circuits. 
J. Neurosci. 29, 8621-8629 (2009). 
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Metastatic growth in distant organs is the major cause of cancer 
mortality. The development of metastasis is a multistage process 
with several rate-limiting steps’. Although dissemination of tumour 
cells seems to be an early and frequent event’, the successful 
initiation of metastatic growth, a process termed ‘metastatic 
colonization’, is inefficient for many cancer types and is accomp- 
lished only by a minority of cancer cells that reach distant sites**. 
Prevalent target sites are characteristic of many tumour entities’, 
suggesting that inadequate support by distant tissues contributes to 
the inefficiency of the metastatic process. Here we show that a small 
population of cancer stem cells is critical for metastatic coloniza- 
tion, that is, the initial expansion of cancer cells at the secondary 
site, and that stromal niche signals are crucial to this expansion 
process. We find that periostin (POSTN), a component of the 
extracellular matrix, is expressed by fibroblasts in the normal tissue 
and in the stroma of the primary tumour. Infiltrating tumour cells 
need to induce stromal POSTN expression in the secondary target 
organ (in this case lung) to initiate colonization. POSTN is required 
to allow cancer stem cell maintenance, and blocking its function 
prevents metastasis. POSTN recruits Wnt ligands and thereby 
increases Wnt signalling in cancer stem cells. We suggest that the 
education of stromal cells by infiltrating tumour cells is an import- 
ant step in metastatic colonization and that preventing de novo 
niche formation may be a novel strategy for the treatment of 
metastatic disease. 

We aimed to explore limiting factors that determine metastatic 
success using the MMTV-PyMT mouse breast cancer model, which 
spontaneously metastasizes to the lungs®’. We reasoned that the 
recently identified cancer stem cells (CSCs, also called tumour-initiating 
cells), a subset of cancer cells that allow long-term tumour growth and 
are thought to be responsible for remissions”*, might also be relevant to 
the development of metastatic disease (Supplementary Fig. 1). We 
measured the relative size of the population of CSCs from primary 
MMTV-PyMT tumours and their pulmonary metastases using the 
previously established markers CD90 and CD24, which label a subset 
of the CD24* CD29" or CD24* CD49" population used earlier to 
isolate CSCs and normal mammary gland stem cells’? (Supplemen- 
tary Fig. 2). This CSC subset accounts for 3 + 2.1% (s.d.) of all tumour 
cells from both primary tumours and metastases (Fig. 1a). When 
CD90*CD24* CSCs or CD90* CD24" -depleted non-CSCs are sepa- 
rately isolated from GFP* tumours and directly introduced into mice 
through tail vein injection (GFP, green fluorescent protein), only the 
CSC population is able to produce lung metastases (Fig. 1b). Moreover, 
CD90* CD24" cells isolated subsequently from pulmonary metastases 
are again the only tumour cell population that efficiently initiates 
secondary metastases (Fig. 1c). This is not due to differences in the 
extravasation capabilities of CSCs and non-CSCs (Supplementary 
Fig. 3). 


In time course experiments, the relative size of the CSC population 
changes drastically during metastatic colonization. When injecting 
unfractionated tumour cells, the total number of tumour cells in the 
lung declines rapidly within the first 7d after seeding’* (Fig. 1d and 
Supplementary Fig. 4). Notably, the relative amount of CSCs 
transiently increases within the first and second weeks to more than 
20%. This is due to selective expansion of the stem cell population as a 
result of their proliferation rate increasing over sixfold relative to the 
primary tumour in this early phase of metastatic colonization (as 
measured by BrdU incorporation; Fig. le and Supplementary Fig. 5). 
By contrast, non-CSCs show reduced proliferation 7 d after injection 
and, notably, fail to generate CSCs (Fig. le and Supplementary Fig. 6). 
Consequently, only CSCs are able to form metastatic colonies, and 
non-CSCs remain as single cells (Fig. 1f). Together, this demonstrates 
that selective expansion of CSCs is responsible for the initiation of 
metastasis. 

The number of injected CSCs evidently exceeds the number of 
metastatic nodules, indicating that additional factors restrict successful 
metastatic colonization. Stem cells are suspected to rely on signals 
from their stromal environment, such as localized growth factors that 
can affect stem cell maintenance and proliferation’ ’”. We identified 
Postn as a stromal factor of normal stem cell niches and the metastatic 
niche (Supplementary Methods). Postn encodes for periostin, which 
becomes incorporated in the extracellular matrix’® and has a role in 
bone, tooth and heart development and function’’~’. Its expression is 
downregulated in the adult except in niches in direct contact with 
tissue-specific stem cells in mammary gland, bone, skin and intestine 
(Fig. 2a and Supplementary Figs 7 and 8). In tumours, POSTN is 
produced by stromal aSMA*VIM* (aSMA also known as ACTA2) 
fibroblasts according to RNA in situ hybridization, immunostaining 
and quantitative PCR (Fig. 2b-h). Notably, POSTN expression is 
induced in the lung stroma by infiltrating cancer cells (Fig. 2d), but 
does not occur in the alveolar lung tissue of tumour-bearing but 
metastasis-free animals (Supplementary Fig. 8). Human breast cancer 
patients show induction of stromal POSTN expression in 75% oflymph 
node metastases (Fig. 2i, j and Supplementary Fig. 9). Whereas POSTN- 
deficient mice maintain normal mammary gland development (Sup- 
plementary Figs 10-12), MMTV-PyMT Postn ‘~ breast cancers show 
a dramatic decrease in the number and size of pulmonary metastases, 
to less than 10% of controls, despite unaltered primary tumour size and 
morphology (n = 37, P < 0.003; Fig. 3a-d and Supplementary Figs 13- 
15). Metastasis formation from POSTN-deficient tumour cells is res- 
cued in wild-type recipients (Supplementary Figs 16 and 17), where 
expression of POSTN is at the same level as in controls (Supplementary 
Fig. 18), indicating that stromal production of POSTN determines 
metastatic efficiency. 

In searching for tumour-derived factors that can induce stromal 
POSTN, in silico promoter analysis predicted regulation mediated 
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Figure 1 | Cancer stem cells initiate metastasis. (a) Representative density 
plot showing the abundance of cancer stem cells in MMTV-PyMT breast 
tumours defined as CD24* CD90" after gating for viable (7-AAD_ ) and lin™ 
(CD31 CD45 TER119_ ) cells (not shown). b, c, CSCs are the only cells to 
form pulmonary metastases on tail vein injection. CD24*CD90* CSCs and 
non-CSC populations from GFP* tumour cells freshly isolated from primary 
tumours (b; 10° cells injected each) and metastases (c; 10° cells injected each) 
were separately injected in recipient mice. The frequency of CSCs (red; P< 0.01 
(b), P< 0.05 (c)) is maintained at the metastatic site (shown as pie charts; n = 6 
each; errors, s.e.m.). Scale bar, 1 mm. d, Time course experiments show selective 
proliferation of CSCs during metastatic colonization. GEP* tumour cells (10°) 
were intravenously injected into recipient mice. At the indicated time points, 
the total proportion of GFP* tumour cells in the lung (black line) and the 
relative amount of the CSC population (red line) were evaluated by analysis 
using fluorescence-activated cell sorting (FACS). Note the transient increase of 


by SMAD, NF-«B and LEF/TCF. Primary lung fibroblasts upregulate 
POSTN in response to TGF-B3 and TGF-2, but are not responsive to 
BMP4, Wnt3A, Wnt5A or TNF-« (Fig. 3e, Supplementary Fig. 19 and 
data not shown). Co-culture experiments revealed that tumour cells 
are sufficient to trigger POSTN production (Supplementary Fig. 20). 
Notably, CSCs and non-CSCs both produce high levels of TGF-83 
(Supplementary Fig. 21), and blocking the action of TGF-f3 by expres- 
sion of a secreted decoy receptor’* (TGFBR2ATM; see Methods) in 
tumour cells blocks POSTN expression and prevents metastasis forma- 
tion (Fig. 3f and Supplementary Fig. 22) in line with earlier results”. 
Together, these experiments demonstrate that infiltrating tumour 
cells need to educate the host stroma of the target organ to support 
metastasis initiation. 

Even small metastatic colonies are strongly diminished in POSTN- 
deficient animals (Supplementary Fig. 23). Having shown that metastatic 
colonization depends on cancer stem cells, we went on to assess a 
potential role of POSTN in stem cell maintenance. Growth under 
conditions of ultralow attachment has been used to study CSCs in 
vitro“. Surprisingly, establishment of such cancer stem cell cultures 
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CSCs after one week (n = 4 per time point; errors, s.e.m.). e, f, CSCs are the only 
cells able to initiate growth in a short-term in vivo colonization assay. e, One 
week after tail vein injection of 10° GEP~* tumour cells, mice were injected with 
BrdU. After 2h, CSCs and non-CSCs were isolated by FACS and the frequency 
of proliferating cells was evaluated by BrdU staining on cytospins (n = 12, 
P<0.05; errors, s.d.; representative example is shown on the right). Note the 
increase of proliferation in the CSC population during the early phase of lung 
colonization; by contrast, proliferation does not significantly differ between the 
two populations in the primary tumour. f, GEP* CSCs (2 X 10°) or GEP* non- 
CSCs (4 X 10°) were tail vein injected and analysed after two weeks. CD90*/ 
CD24‘ -depleted non-CSCs remained as single cells whereas CSCs were able to 
initiate growth and form metastases of different sizes (S, 30-300 cells; M, 300- 
3,000 cells; L, >3,000 cells). The upper panel shows a representative example of 
the different metastasis colonies and the results are cumulatively quantified in 
the bottom panel (n = 6). Scale bars, 20 um (e) and 250 um (f). 


is not possible using POSTN-deficient tumours (Fig. 3g-j; n = 16; 
compare with Supplementary Fig. 24, which shows unaltered cell 
survival in standard two-dimensional culture). In wild-type tumour 
spheres, POSTN is expressed by tumour-derived, stromal fibroblasts 
(Supplementary Fig. 25). Adding periostin protein to the mutant 
tumour cells rescues sphere formation (Fig. 3j). Conversely, a blocking 
antibody targeting POSTN prevents maintenance of wild-type cancer 
stem cells (Supplementary Fig. 30), but affects neither cellular survival 
nor growth per se (Supplementary Figs 26 and 27). Furthermore, CSCs 
fail to proliferate when co-cultured with POSTN-mutant, pulmonary 
fibroblasts (Fig. 3k). In lung metastases, we observe a preferential 
localization of CD90* CSCs adjacent to stromal niches (Fig. 31), 
whereas lung metastases in POSTN-deficient animals show a reduc- 
tion in the size of the CSC population (Fig. 3m). Thus, POSTN is an 
essential niche factor that supports CSC growth during metastatic 
colonization. 

To gain insight into how POSTN is related to the mechanisms that 
control stem cell maintenance, we characterized the interactome of 
POSTN by tandem affinity purification (TAP)-tag enrichment and 
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Figure 2 | POSTN is a stromal niche component that is induced on 
metastasis formation. a-c, Stromal POSTN expression as detected by RNA in 
situ hybridization (POSTN-expressing cells in blue) around terminal end buds, 
which are enriched for mammary gland stem cells during puberty (a), tumour 
cells at the primary site (b) and in pulmonary metastases (c; outlined in red). 
d, Immunohistochemistry of POSTN expression in pulmonary metastases at 
different stages. e~g, Immunofluorescence analysis of POSTN and fibroblast 
markers in pulmonary metastases. POSTN is not expressed by PyMT tumour 
cells (g), but is from aSMA* (e) and VIM" (f) fibroblasts. The white outline in 


tandem mass spectrometry (Supplementary Methods), and this pointed 
to binding of POSTN to Wnt ligands**. Using TAP-tag pull-downs, we 
confirmed interaction between POSTN and Wntl and Wnt3A, but 
found that a known inhibitory ligand of the pathway, DKK1, does not 
associate with POSTN (Fig. 4a, b and Supplementary Fig. 28). The 
interaction between POSTN and Wnt ligands boosts Wnt signalling 
activity as measured by LRP6 phosphorylation and reporter assays 
using Wntl, Wnt2 and Wnt6 (Fig. 4c, d). The Wnt pathway is known 
to control stem cell maintenance in a variety of tissues, including 
mammary gland” and tumours*’””*. Indeed, addition of Wnt3A can 
rescue in vitro stem cell expansion in the absence of POSTN (Sup- 
plementary Figs 30 and 31). Furthermore, we find Wnt signalling 
activity to be concentrated in the CSC population in vivo as analysed 
by FACS (Fig. 4e, f). Here, we use a lentiviral reporter that allows Wnt- 
inducible GFP expression and constitutive labelling of individual 
reporter-containing cells by human CD2 (Supplementary Fig. 29). 
Wnt signalling activity in metastases is abrogated in the absence of 
POSTN (Fig. 4g; see Supplementary Fig. 29 for Wnt activity in tumour 
spheres). Similarly, we detect higher levels of the general Wnt target gene 
Axin2 in metastases in wild-type hosts than in mutant hosts (Fig. 4h, i). 
Remarkably, metastasis of Wnt-driven breast cancer proves to be inde- 
pendent of stromal POSTN (Fig. 4j). Thus, POSTN acts as a niche 
component that can promote stem cell maintenance and metastatic 
colonization by augmenting Wnt signalling (Supplementary Fig. 1). 
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e defines the tumour-stroma border. h, Quantitative PCR with reverse 
transcription detects POSTN expression in CD34~CD31~ pulmonary 
fibroblasts, in CD31~* endothelial cells but not in CD45* immune cells that 
were isolated by FACS from lungs of mice with macrometastases ( = 3; errors, 
s.d.). i, j, Immunohistochemistry of POSTN expression in auxiliary lymph 
nodes of human breast cancer patients with metastasis-free NO disease (j) or 
metastatic disease (i). A total of 17 of 23 lymph node metastases were positive 
for POSTN expression. Scale bars: 20 um (a) and 2.5 im (a, inset); 50 um 

(b; same scale in c) and 10 um (b, inset); 50 tum (d); 20 um (e-g); 200 pum (i, j). 


Thus, instrumental factors that contribute to the inefficiency of the 
metastatic process are the low abundance of CSCs, which are required 
to initiate growth at the secondary site, and incompatibilities with 
‘foreign’ niches, which necessitate education of the target organ to 
resemble the primary microenvironment. Periostin is an essential 
component of this CSC-supportive niche and needs to be induced in 
the lung stroma by infiltrating tumour cells. Together with a recent 
report that implicated tumour-cell-derived tenascin C in breast cancer 
metastasis, these examples demonstrate an essential role for single 
proteins of the extracellular matrix in metastatic colonization””®. 

Surprisingly, POSTN deficiency affects neither normal mammary 
gland development nor tumour formation or the relative size of the 
CSC population at the primary site (Supplementary Fig. 32). This may 
be due to a greater complexity and redundancy in the primary niche, 
such that the loss of a single factor can be tolerated. By contrast, loss 
of only one important factor, such as POSTN, prevents metastatic 
colonization in the presumably less supportive and less complex 
secondary site. Accordingly, the early phase of metastasis can be 
anticipated to be particularly sensitive to therapeutic intervention 
because the dependence of cancer cells on niche signals is probably 
highest in that phase. Targeting this metastatic niche promises to be 
less sensitive to rapid genetic changes in cancer cells and may not only 
prevent metastatic colonization but may also interfere with the survival 
of disseminated, dormant cancer cells. 
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Figure 3 | POSTN is required for metastatic 
colonization by affecting CSC maintenance. 

a, b, Primary tumour size (a) and number of 
spontaneously formed macrometastases (b) in 
control and Postn”'~ mice (n = 37). ¢, d, HE section 
of spontaneous, pulmonary metastases in control 
(c) and POSTN-mutant (d) animals. e-f, TGF-B3 
induces POSTN expression in lung stromal cells. 

e, TGF-B3 was added to lung stromal cells to test its 
ability to induce POSTN expression. RNA 
expression by quantitative PCR and protein 
expression by western blot analysis (inset) were 
measured at the indicated concentrations. f, Blocking 
of TGF-3 abrogates metastasis formation. Cancer 
cells were infected either with a control virus or with 
a virus expressing secreted, dominant negative 
TGFBR2ATM and separately intravenously injected 
into recipient mice. Metastasis formation was 
evaluated 6 weeks after injection (n = 12; errors, 
s.d.). g, Frequency ofmammosphere formation from 
primary tumours. WT, wild type. h, Representative 
picture of control tumour mammospheres. i, Loss of 
CSCs in secondary mammosphere assays from 
Postn ‘~ tumours. j, Sphere formation by Postn™ 
tumour cells can be rescued by addition of 

50ng ml of periostin to primary cultures. Control 


j= 


tumour cells produce 13.3 + 2.5 spheres per 30,000 
cells. Quantification is shown in the histogram on the 
right (n = 4; errors, s.d.). k, Only stromal cells 
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8 1.0 from wild-type or Postn ‘~ mice. Colony formation 
8 was evaluated after two weeks (n = 3; errors, s.d.). 
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CSCs in rare pulmonary metastases in POSTN- 
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Figure 4 | POSTN promotes stem cell maintenance and metastasis 
formation by augmenting Wnt signalling. a, Interaction of TAP-tagged 
POSTN and Wnt3A and Wnt1 as determined by pull down and western 
blotting. b, Interaction of TAP-tagged POSTN and DKK1 as determined by 
pull down and western blotting. Note the absence of interaction between 
POSTN and DKK1. c, LRP6 phosphorylation (pLRP) increases on POSTN 
expression. d, Wnt reporter activity in response to Wnt1, Wnt2 or Wnt6 is 
stimulated by POSTN (n = 3; errors, s.d.). e, f, In vivo Wnt reporter assay. 
Tumour cells were infected with a lentivirus containing both an artificial Wnt- 
responsive promoter driving GFP expression and a constitutive promoter 
expressing human CD2 (Supplementary Fig. 29). Cells were analysed by FACS 
considering only human-CD2", reporter-containing cells. CD90*CD24* 
CSCs and non-CSCs were scored for Wnt reporter activity as GFP". The signal 
concentrates in CSCs in both primary tumours (e) and pulmonary metastases 
(f) (n = 3; errors, s.e.m.). g, CbD90* CD24* wild-type CSCs containing a Wnt 
reporter (human CD2™) show signalling activity (GEP~) only when seeded in 
wild-type as opposed to Postn/— lungs ( = 3; errors, s.e.m.). 

h, i, Immunofluorescence staining of AXIN2 in control (h) and POSTN- 
deficient (i) lung metastases (outlined in white). Note the absence of staining in 
POSTN lung metastases. Scale bar, 10 ttm. j, MMTV-Wnt1 tumour cells can 
form pulmonary metastases in the absence of POSTN expression in the stroma. 
Tumour cells were orthotopically transplanted into mammary glands of wild- 
type or POSTN-deficient recipients and analysed after ten weeks for 
spontaneous lung metastasis (n = 5 each; errors, s.d.). 
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METHODS SUMMARY 


Details on the mouse models used and the generation of the Postn gene ablation 
can be found in Supplementary Information. Tumours derived from MMTV- 
PyMT mice were collected and cells were isolated and directly stained for FACS 
analysis or sorting using antibodies against CD31, CD45, TER119 (LY76), CD90, 
CD24 and CD49f (ITGA6). FACS-isolated tumour cells were directly transplanted 
into recipient mice. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mouse work. The design of the POSTN targeting vector is shown in 
Supplementary Fig. 10. E14.1 129/Ola ES cells were used for targeting and were 
injected into C57Bl6 blastocysts. The POSTN mutant allele was then backcrossed 
onto the FVB background for at least eight generations. The MMTV-PyMT", 
MMTV-Wntl” and the ACT-GFP*’ mouse strains have been described previ- 
ously. All used strains were backcrossed onto FVB for ten generations except for 
the Nude allele, which was used on a NMRI background. For tumour cell trans- 
plantations, Nude or Rag ~'~ mice were used when tumour cells expressed GFP or 
had been modified by lentiviral vectors. In general, we did not observe major 
differences in these different strains with respect to cancer stem cell phenotype, 
tumour take rate or metastasis number, except for an overall faster growth in 
Ragl ~'~ versus Nude versus wild type. For tail vein injections, tumour cells were 
resuspended in 100 pl of PBS. For orthotopic transplantations, tumour cells were 
resuspended in 50 pl growth-factor-reduced matrigel (BD Biosciences) and trans- 
planted into a small pocket within the fourth mammary fat pad. All animal 
procedures were performed in accordance with the Swiss legislation on animal 
experimentation. 

Antibodies. We used TER119, CD24, CD29, CD34, CD45 and CD90.1 
(eBioscience); CD31 and GFP (Invitrogen); Cytokeratin 8, POSTN, AXIN2 and 
Wntl (Abcam); POSTN (Adipogen); Cytokeratin 14 (Covance); GATA-3 and 
PyMT (Santa Cruz Biotechnology); «SMA (Sigma); VIM (Lifespan Biosciences); 
and BrdU (BD Pharmingen and ImmunologicalsDirect). The antibody against 
human CD2 was a kind gift by Professor Werner Held (Ludwig Center for Cancer 
Research, Lausanne). 

Tumour and metastasis evaluation. For the quantification of spontaneous 
micrometastasis, lungs were completely sectioned and HE stained, and the size 
and the number of metastatic nodules were analysed by microscopy. 
Quantification of lung metastases derived from GFP* tumour cells was performed 
by fluorescent microscopy (Leica M205 FA stereomicroscope) or by FACS analysis 
of cell preparations from total lungs. All statistical evaluation used a homoscedastic 
Student’s t-test with a two-tailed distribution. 

For expression of a secreted TGFB decoy receptor (TGFBR2ATM), tumour cells 
were transduced with lentiviral vectors expressing the extracellular domain 
(amino acids 1-185) of murine TGFBR2 (NM_009371.3) under the control of a 
murine phosphoglycerate kinase (PGK) promoter, and metastases were scored as 
described above following tail vein injection of transduced tumour cells. 
Tumour cell isolation. Tumours or lungs containing pulmonary metastases were 
minced with a razor blade and digested with a mixture of DNase and Liberase 
(Roche Diagnostics). On enzymatic digestion, samples were passed through a 
100-1m filter and washed once in growth medium (DMEM/F12 with 2% FBS, 
20ng ml ' EGF, 10 pg ml’ insulin; Invitrogen) and twice in PBS. Cells were then 
directly used for FACS staining and analysed in a Cyan ADP flow cytometer 
(Beckman Coulter) or sorted with a custom-built FACSAria II (Beckton 
Dickinson). For tail vein injection, cells were plated overnight on collagen. In 
the experiments evaluating the metastasis initiation potential of CSCs versus 
non-CSCs, we were able to prepare and inject higher cell numbers from metastases 
than from primary tumours, where larger necrotic areas limit the obtainable cell 
amounts. In both settings, only CSCs were able to give rise to pulmonary 
metastases, but this experimental setting is unsuitable for a direct comparison of 
the metastatic potential of CSCs isolated from primary tumours with that of those 
isolated from metastases. In the extravasation experiments, we performed several 
pilot experiments perfusing the lungs before dissection to distinguish cells trapped 
inside capillaries from those that successfully extravasated. Perfusion was done 
with Ringer’s buffer through the right ventricle for 5-10 min. 

Tumour mammosphere cultures. Sphere cultures were established from total 
tumour cell preparations. After dissociation, tumour cells were plated on collagen 
overnight, trypsinized and plated in 100 il of sphere media (DMEM/F12 with B27, 
20ngml EGE, 20ngml 'FGFand4 pg ml * heparin) into 96-well, low-attach- 
ment plates (Constar) at a concentration of (1-3) x 10° cells per well. For the anti- 
POSTN blocking antibody assay, half of the culture medium was replaced by 
hybridoma supernatant. Secondary mammosphere culture was performed by col- 
lecting the spheres through gentle centrifugation (130g) followed by enzymatic 
(10 min in 0.05% trypsin-EDTA at 37 °C) and mechanical dissociation using a 25G 
needle. Immunostainings were performed as whole mounts of PFA-fixed and 0.1% 
Triton-X100-permeabilized spheres and were analysed by confocal microscopy. 

Identification of POSTN as a niche-expressed factor. To identify niche- 
expressed factors, we performed microarray RNA profiling studies where we 
analysed laser-capture microdissected material from normal stem cells together 
with their adjacent stroma. This was compared with laser-capture microdissected 
material isolated from differentiated cells together with their adjacent stroma from 
the same tissue (data not shown). Skin and intestine contain stem cells in mor- 
phologically distinct structures (the bulge in the skin, the crypt in the intestine). On 


the basis of these profiles (accessible through GEO GSE31730 and 8818), 
we evaluated potential niche localization of candidate genes by RNA in situ 
hybridization and immunofluorescence analysis (Fig. 2a, Supplementary Fig. 7 
and data not shown). Moreover, we profiled the candidate genes by quantitative 
PCR with reverse transcription and performed RNA in situ hybridization in other 
tissues, in tumours and in metastases (data not shown). This identified POSTN as 
a stromal marker in primary murine breast cancer and its metastases (Fig. 2). 
Immunodetection and in situ hybridization. Immunostaining was performed 
on 7-11m OCT- or gelatin-embedded (to maintain GFP signals) frozen sections 
permeabilized with 0.1% Triton-X100 or on 5-km paraffin sections using antigen 
retrieval for 20 min in boiling, 10 mM citric acid, pH 6.0. For BrdU immunostain- 
ing, an additional incubation period of 10 min in 2N HCl was performed. For 
immunohistochemistry, endogenous peroxidases were blocked by incubation with 
0.6% H,O, in methanol. After incubation with the indicated antibodies, secondary 
fluorescently labelled antibodies Alexa Fluor 488, 567 and 647 (Molecular Probes, 
Invitrogen) or HRP-conjugated secondary antibodies were used to reveal the 
primary antibodies. For immunohistochemistry, Dako Envision was used 
together with diaminobenzidine tablets (Sigma) to detect HRP. Fluorescent 
images were made with an automated upright microscope system (DM5500, 
Leica) or an LSM700 upright or inverted confocal microscope (Zeiss). Light 
images were made with an AX70 widefield microscope (Olympus). Figures were 
generated with PHOTOSHOP (Adobe Systems) and CYTOSKETCH (CytoCode). 
In situ hybridizations were performed on paraffin sections permeabilized by 
proteinase K digestion using DIG-labelled (Roche), antisense transcripts of the 
mouse POSTN complementary DNA. After hybridization, section were washed in 
50% formamide, <5 SSC (0.75 M NaCl, 85 mM sodium citrate, pH 7.0) and 0.1% 
Tween at 70 °C, and non-specifically bound probe was digested by RnaseA treat- 
ment, followed by extensive washing in 50% formamide, <2 SSC and 0.1% Tween 
at 65°C. The DIG label was detected by an anti-DIG Fab (Roche) coupled to 
alkaline phosphatase using FAST NBT/BCIP (Sigma). 
Western blot. Protein was extracted with complete RIPA buffer, separated by 
electrophoresis, transferred to PVDF membranes, blocked with 5% dry milk or 
BSA and incubated overnight with primary antibodies. Immunoreactive bands 
were visualized using HRP-conjugated secondary antibodies (Promega) and the 
detection reagent SuperSignal West Cheminoluminiscent Substrate (Thermo 
Scientific). 
Real-time RT-PCR. RNA was prepared using a miniRNA or microRNA kit 
(Qiagen) from FACS-isolated primary cells. Complementary DNAs were generated 
using oligo-T priming and quantitative PCR was performed in a LightCycler 
(Roche) or a StepOnePlus thermocycler (Applied Biosystems) using the Power 
SYBR green PCR Master Mix (Applied Biosystems) and the primers aatgctgccctggc 
tatatg and gcatgacccttttccttcaa (POSTN); agcgcagacacaacccatag and gggttctgccca 
catagtacaag (TGF-B3); and caagctcatttcctggtatgacaat and gttgggatagggcctctcttg 
(GAPDH). 
Periostin monoclonal antibody production. In collaboration with Adipogen, we 
injected Postn ‘~ mice with purified human POSTN protein. Hybridomas were 
generated and tested for antibody production by ELISA. Antibodies were selected 
for their ability to recognize POSTN and for their ability to block tumour mammo- 
sphere formation in vitro. 
Periostin interactome and pull-downs. Freshly isolated tumour cells were grown 
under mammosphere conditions in the presence of purified, polyHis-tagged and 
streptavidin binding peptide (SBP)-tagged POSTN protein (Supplementary Fig. 28) 
for 72h. Then proteins were crosslinked by 1 mM DTSSP (Pierce) for 10 min and 
solubilized by DDM and SDS (Anatrace). Protein purification was achieved by two 
successive affinity purification steps using the SBP and the polyHis tags and elution 
by biotin and imidazole, respectively. After elution, the crosslink was released by 
DTT treatment, the resulting protein mixture was trypsin-digested and peptides 
were identified by one-dimensional nano-liquid chromatography tandem mass 
spectrometry. Among the POSTN-attached proteins, we found Wnt6 to be a 
potential interactor. Because these experiments involved crosslinkers, we consider 
these results to be preliminary unless confirmed by independent experiments. 
For Wnt6, an interaction with POSTN could not be validated owing to a lack of 
Wnt6 antibodies and a suitable expression system for Wnt6. Nevertheless, this 
preliminary finding prompted us to test directly for interactions of POSTN with 
other Wnt ligands where the necessary tools are available. For these pull-downs, 
cDNAs encoding Wnt1, HA-tagged Wnt3A, DKK] and SBP/His-tagged POSTN 
(Supplementary Fig. 28) or SBP/His-tagged mouse immunoglobulin-G heavy 
chain (as a control) were transfected separately into 293T cells. Cells secreting 
POSTN or immunoglobulin-G and cells producing Wnt or DKK were mixed after 
one day and collected for pull-downs after three additional days. Without the use of 
crosslinkers, POSTN-interacting proteins were enriched by the SBP/His tag as 
described above using NP40 as detergent. Western blot was performed for iden- 
tification of the pulled-down proteins. 
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Wnt reporter activity. HEK293T cells were transduced with a lentiviral vector 
containing a LEF/TCF-responsive luciferase reporter consisting of a minimal 
promoter and 20 copies of the LEF/TCF consensus site or a control construct in 
which the LEF/TCE sites had been mutated**. These stable Wnt reporter cell lines 
were then transiently transfected with a POSTN expression vector before these 
cells were mixed with transiently transfected HEK293T or COS7 cells (the latter 
for Wnt6 expression) producing Wnt1, Wnt2 or Wnt6, respectively. The media on 
these co-cultures of Wnt reporter and Wnt producer cells were replaced after 24h 
to remove soluble Wnt ligands. After 48h of co-culture, luciferase activity in cell 
extracts was measured with a kit from Promega. 

For in vivo reporter assays, we generated a lentiviral vector containing the same 
transcription factor sites and minimal promoter as described above driving GFP 
expression. In addition, this vector contained a ubiquitous PGK promoter driving 
expression ofa truncated human CD2 cDNA that allows identification of reporter- 
containing cells by FACS (Supplementary Fig. 29). Lentiviruses were generated in 
293T cells using third-generation lentivirus packaging vectors, and virus particles 
were concentrated by ultracentrifugation. Freshly isolated tumour cells were 
infected with lentiviruses overnight, enriched by anti-CD2 MACS (Miltenyi) for 
cells carrying the reporter construct and orthotopically transplanted for tumour 
formation and spontaneous metastasis. Wnt signalling activity in cells of primary 
tumours or spontaneous pulmonary metastases was quantified by FACS analysis by 
measuring GFP* cells after gating for human-CD2‘" cells. Similarly, human-CD2~ 
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tumour cells were used for tumour mammosphere cultures to measure Wnt path- 
way activity in the presence or absence of the POSTN blocking antibody. 
Isolation of lung stromal cells. Lungs were minced with a razor blade and 
digested with a mixture of DNase and Liberase (Roche Diagnostics). On enzymatic 
digestion, samples were passed through a 100-,1m filter and washed once in growth 
medium (DMEM/F12 with 2% FBS, 20ng ml! EGF and 10 uml 1 insulin; 
Invitrogen) and twice in PBS. Cells were plated onto plates pretreated with collagen 
and allowed to attach for three days. The medium was then changed to serum-free 
medium and tumour cells or soluble factors were added to test POSTN induction. 
POSTN expression was evaluated by western blot, PCR with reverse transcription 
or immunofluorescence analysis. 
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Reversal of cocaine-evoked synaptic potentiation 
resets drug-induced adaptive behaviour 


Vincent Pascoli!, Marc Turiault! & Christian Liischer’? 


Drug-evoked synaptic plasticity is observed at many synapses and 
may underlie behavioural adaptations in addiction’. Mechanistic 
investigations start with the identification of the molecular drug 
targets. Cocaine, for example, exerts its reinforcing’ and early 
neuroadaptive effects’ by inhibiting the dopamine transporter, 
thus causing a strong increase in mesolimbic dopamine. Among 
the many signalling pathways subsequently engaged, phosphoryla- 
tion of the extracellular signal-regulated kinase (ERK) in the nucleus 
accumbens’ is of particular interest because it has been implicated in 
NMDaA-receptor and type 1 dopamine (D1)-receptor-dependent 
synaptic potentiation’ as well as in several behavioural adapta- 
tions®*. A causal link between drug-evoked plasticity at identified 
synapses and behavioural adaptations, however, is missing, and the 
benefits of restoring baseline transmission have yet to be demon- 
strated. Here we find that cocaine potentiates excitatory trans- 
mission in D1-receptor-expressing medium-sized spiny neurons 
(D1R-MSNs) in mice via ERK signalling with a time course that 
parallels locomotor sensitization. Depotentiation of cortical nucleus 
accumbens inputs by optogenetic stimulation in vivo efficiently 
restored normal transmission and abolished cocaine-induced 
locomotor sensitization. These findings establish synaptic potentia- 
tion selectively in D1R-MSNs as a mechanism underlying a core 
component of addiction, probably by creating an imbalance 
between distinct populations of MSNs in the nucleus accumbens. 
Our data also provide proof of principle that reversal of 
cocaine-evoked synaptic plasticity can treat behavioural alterations 
caused by addictive drugs and may inspire novel therapeutic 
approaches involving deep brain stimulation or transcranial 
magnetic stimulation. 

We first tested whether cocaine treatment interfered with activity- 
dependent long-term potentiation (LTP) in the nucleus accumbens. 
When excitatory afferents onto MSNs were challenged with a high- 
frequency stimulation (HFS) train, LTP of the excitatory postsynaptic 
currents (EPSCs) was observed (Fig. 1a). The magnitude of the LTP 
was halved in brain slices from mice that had received a single injection 
of cocaine 7 days before the recording. If the cocaine was injected a 
month before measuring synaptic plasticity ex vivo, the difference was 
no longer present (Fig. 1b). MSNs of the nucleus accumbens fall into 
two classes of about equal proportions defined by the type of dopamine 
receptor expressed, with a small fraction of neurons (6-17%) that 
express both receptors’. A possible explanation for the partial change 
in LTP magnitude is therefore that cocaine exposure abolishes plasticity 
selectively in one class. To test this, we attempted LTP induction ex vivo 
after cocaine exposure in bacterial artificial chromosome (BAC) trans- 
genic mice expressing enhanced green fluorescent protein (eGFP) 
either in D1 receptor (DIR)- or D2R-MSNs. We identified D1R- 
MSNs by a crossover strategy in which we recorded from green cells 
in drdla-eGFP mice and non-green cells in drd2-eGFP mice (and vice 
versa for D2R-MSNs). Because the two approaches to identify the cell 
type yielded very similar results, we pooled the data (Fig. 1c, d). The 
main finding of this first experiment was that HFS, which reliably 


induced LTP in both types of MSNs after saline injection, became 
inefficient after cocaine treatment in D1R-MSNs (Supplementary 
Fig. la-d). The D2R overexpression recently reported in drd2-eGFP 
mice” interferes with neither the reported synaptic effects nor with the 
acute locomotor response to cocaine (Supplementary Fig. 2). After a 
cocaine injection, we were unable to induce HFS LTP regardless of 
whether we recorded from MSNs in the shell or the core of the nucleus 
accumbens (Supplementary Fig. 3), but because MSNs in the shell are 
the immediate targets of the dopamine neurons of the medial ventral 
tegmental area, which undergo the most significant changes in res- 
ponse toa single cocaine injection"', we focused on nucleus accumbens 
shell neurons in the present study. Without distinguishing between 
D1R- and D2R-MSNs, several reports have already suggested that 
drug-evoked synaptic plasticity in the nucleus accumbens may 
underlie drug-related behavioural adaptations (reviewed in ref. 12). 
For example, when 5-7 daily cocaine injections were followed by a 
10-21-day withdrawal, an overall increase of the AMPA/NMDA 
receptor (AMPAR/NMDAR) ratio” or GluA1/2 surface expression’**° 
was observed, and both observations were reversed by a challenge 
injection of cocaine. 

The failure of HFS to induce LTP selectively in D1R-MSNs after 
cocaine treatment may be due either to the impairment of LTP induc- 
tion or the occlusion of LTP expression. To distinguish between the 
two scenarios, we recorded miniature EPSCs (mEPSCs) in both cell 
types (Fig. le and Supplementary Fig. 4a) and observed a significant 
increase in amplitude along with a modest change in frequency of 
unitary events in DIR-MSNs (whereas these parameters remained 
unchanged in D2R-MSNs). Given that the paired pulse ratio also 
remained unchanged (Fig. 1f and Supplementary Fig. 4b), a post- 
synaptic mechanism underlying the increase of transmission and 
hence an occlusion scenario is the most likely explanation. 

We characterized the induction criteria for HFS LTP in MSNs in 
vitro and found that it depends on NMDAR activation (Fig. 2a). 
Applying the MEK (MAP ERK kinase) inhibitor U0126 for the dura- 
tion of the induction protocol also led to a complete block (Fig. 2b), 
indicating that activation of the ERK pathway is essential for this form 
of LTP, akin to NMDA-dependent LTP in the hippocampus and the 
dorsal striatum'*””, the latter also depending on D1Rs"*. 

Given that our results indicate an occlusion of LTP, we hypothesized 
that cocaine could drive synaptic potentiation selectively in D1R- 
MSNs via ERK activity. Indeed, a sharp increase in phosphorylated 
ERK is detected soon after cocaine exposure in DIR-MSNs, but not in 
D2R-MSNs”. In the nucleus ERK modulates gene expression, whereas 
in dendrites ERK is probably involved in the regulation of activity- 
dependent spine dynamics, synaptic glutamate receptor insertion and 
local dendritic protein synthesis’. To provide in vivo evidence for ERK 
dependence of cocaine-evoked synaptic plasticity, we treated mice 
with $L327, a blood-brain barrier penetrant ERK pathway inhibitor, 
before the saline or cocaine injection. We found that this manipula- 
tion rescued HFS LTP in D1R-MSNs one week later (Fig. 2c, d) 
without modification of the acute locomotor response to cocaine 
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(Supplementary Fig. 5). A crossover control design with the two mouse 
lines again showed no difference between the green cells of one line and 
the non-green cells of the other (Supplementary Fig. 6). 

Because inhibition of the ERK pathway blocks locomotor sensitiza- 
tion to cocaine”, and we found LTP in D1R-MSNs to be dependent 
on ERK, we reasoned that cocaine-evoked potentiation might be a 
cellular correlate of the behavioural adaptation. Moreover, ERK activa- 
tion is correlated both with AMPAR expression at the cell surface 
throughout the nucleus accumbens and with locomotor sensitization”, 
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Figure 1 | Cocaine disrupts HFS-induced LTP in DIR-MSNs of the nucleus 
accumbens by potentiation of excitatory afferents. a-d, Graphs show 
normalized EPSCs as a function of time and the overlay of averaged (20 trials) 
traces of AMPAR EPSCs before (black line) and after (grey line) HFS. Symbols 
represent average of 6 trials. a, One week after cocaine injection, LTP was 
halved (224 + 17.9% to 149 + 14.7%, Student’s t-test ty; = 3.06). b, After 1 
month, no difference between treatments was detected (204 + 26.1% to 

206 + 21.3%, t2; = 0.07). ¢, Using drdla- and drd2-eGFP mice, HFS reliably 
induced LTP in DIR-MSNs and D2R-MSNsafter saline injection (206 + 16.0% 
and 229 + 24.1%, respectively, t7 = 0.78). d, Cocaine abolished HFS LTP in 
D1R-MSNs but not in D2R-MSNs (115 + 8.5% and 239 + 12.7%, respectively, 
tz9 = 7.98). e, Sample traces, cumulative probability and mean values of 
amplitude and frequency for mEPSCs recorded from D1R-MSNs one week 
after injection of saline or cocaine. Cocaine significantly increases mean 
amplitude of mEPSCs in DIR-MSNs (t;4 = 4.68; Kolmogorov-Smirnov test: 
P<0.01) whereas the frequency was unaffected (t,4 = 1.635; Kolmogorov- 
Smirnov: P > 0.05). f, Paired-pulse ratio (PPR, 50 ms inter-stimulus interval) 
measured in D1R-MSNs was not different a week after saline (Sal) or cocaine 
(Coc) injection (1.7 + 0.11 and 1.4 + 0.1, respectively, t34 = 1.73). Scale bars: 
20 ms and 20 pA. n = 8-30, *P < 0.05. Error bars show s.e.m. 
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but the synapses involved have not been identified, and no causal link 
has been established. 

We then confirmed that a single injection of cocaine was sufficient 
to cause locomotor sensitization to a second injection of the same 
dose’. This was the case when challenged a week but not a month 
after the initial cocaine injection (Supplementary Fig. 7). The beha- 
vioural alteration therefore followed a time course similar to the 
cocaine-evoked synaptic potentiation (Fig. la, b). If cocaine-evoked 
potentiation is causally involved in locomotor sensitization, then 
depotentiating these synapses may reverse the behavioural alterations. 
To test this prediction experimentally, we injected channelrhodopsin 
(ChR2)-expressing adeno-associated virus (AAV) into the infralimbic 
cortex and implanted light guides into the nucleus accumbens to be 
able to selectively activate in vivo the terminals of this major excitatory 
input. Histological verification of the injection site did confirm robust 
expression in the infralimbic cortex, along with sparser expression in 
the prelimbic cortex (Fig. 3a and Supplementary Fig. 8). To validate 
further this approach, we recorded photocurrents from infected 
cortical neurons and prepared slices of the nucleus accumbens shell 
in which wide-field light exposure led to robust AMPAR-mediated 
EPSCs (Fig. 3b). We next applied light pulses at 1 Hz for 10 min (an 
established long-term depression (LTD) protocol to reduce synaptic 
transmission, see Methods). We found that this protocol strongly 
depressed transmission in nucleus accumbens slices from both saline 
and cocaine-treated mice (Fig. 3c). Interestingly, the magnitude of the 
depression was significantly larger in the latter, in line with an efficient 
depotentiation added to the LTD. As the LTD/depotentiation was 
NMDAR dependent and occurred without change of the paired pulse 
ratio, it is probably mediated by a postsynaptic expression mechanism 
and therefore constitutes an actual reversal of cocaine-evoked poten- 
tiation (Supplementary Fig. 9). 
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Figure 2 | HFS LTP and cocaine-evoked potentiation both depend on ERK 
activation. a, HFS LTP was blocked by an NMDAR antagonist (AP5, 100 |1M; 
control, 209 + 15% versus AP5, 93 + 9%, ty4 = 6.78). b, Bath application of the 
MEK inhibitor (U0126, 5 1M, 15 min before until 10 min after HFS protocol) 
also blocked LTP (control, 234 + 17.8% versus U0126, 109 + 16.2%, 

t13 = 5.047). c, Intraperitoneal administration of the MEK inhibitor (SL237, 
40 mgkg ') in drdla-eGFP mice or drd2-eGEP mice 1h before saline did not 
modify the magnitude of HFS LTP in D1R- or D2R-MSNs when assessed one 
week later (224 + 31% and 221 + 27%, respectively, tig = 0.09). d, SL237, 
administered 1 h before cocaine restored HFS LTP in DIR-MSNs (200 + 22% 
versus 232 + 27% for D2R-MSNs, ty = 0.93). Scale bars: 10 ms and 20 pA. 
n= 6-13, *P< 0.05. Error bars show s.e.m. 
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Figure 3 | Reversal of cocaine-evoked potentiation abolishes locomotor 
sensitization to cocaine. a, Schematic illustration of the site infected with AAV 
coding for eGFP-ChR2 and of the bilateral cannula implantation with optic 
fibres inserted in the nucleus accumbens shell. Insets show AAV-eGFP-ChR2 
expression in the infralimbic cortex and high-resolution image of axonal inputs 
onto nucleus accumbens shell neurons (blue stains nuclei, X60 magnification). 
b, Averaged current traces in response to light pulses (470 nm) of 4 ms and 
100 ms in infected cells in slices of the infralimbic medial prefrontal cortex 
(mPFC; top). In nucleus accumbens (NAc) slices, light pulses (4 ms) evoked 
AMPAR EPSCs (blocked by NBQX 20 1M, green trace, bottom). c, Low- 
frequency stimulation (light pulses of 4 ms at 1 Hz for 10 min) induced 
depression of AMPAR EPSCs evoked with light in nucleus accumbens slices 
from mice injected with saline or cocaine one week previously (48 + 5.7% and 
27 + 4.5%, respectively, t)4 = 2.851). n = 8, *P < 0.05. d, Quarter turns done by 
mice in the circular corridor for 60 min after injection represent locomotion 
(scatter plots of individual score and bars of mean + s.e.m. are shown). 
Multiple-way repeated-measures analysis of variance for matching data 
yielded: interaction between day, virus and treatment F(;35) = 13.41, P< 0.001; 
effect of day Fi;,35) = 78.16, P< 0.001; effect of virus F(1,35) = 17.78, P< 0.001; 
effect of treatment Fi, 35) = 88.65, P < 0.001. Wilcoxon or t-test: *P < 0.001 for 


We next applied the above-validated protocol in vivo with the goal 
of establishing a causal link between cocaine-evoked plasticity and 
behavioural sensitization. We placed light guides into the ventral 
striatum, past the nucleus accumbens core, thus preferentially aiming 
at the principal cortical input onto MSNs of the nucleus accumbens 
shell, that is, axons that have their origin in the infralimbic cortex 
(Supplementary Fig. 8). When freely moving mice were treated with 
the optogenetic depotentiation protocol 45 min before the injection of 
the cocaine challenge at day 8, locomotor sensitization was completely 
erased. In control experiments, light stimulation did not affect the 
locomotor response at day 8 when the first injection was saline instead 
of cocaine or when a control virus was used (Fig. 3d). To ensure that 
the light stimulation restored normal transmission, we recorded ex 
vivo mEPSCs and found that an effective light treatment significantly 
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cocaine versus saline pre-treatment on day 8 in AAV-control and P = 0.203 
cocaine versus saline pre-treatment on day 8 in AAV-ChR2 infected mice; 
**P = 0.0015 for locomotor response to cocaine on day 8 for AAV-ChR2 
versus AAV-control infected mice). No effect of light protocol on locomotor 
response to cocaine on day 8 when mice receive saline on day 1 was detected 
(P> 0.05 AAV-ChR2 versus AAV control). The sensitization index (see 
Methods) was significantly reduced in AAV-ChR2 infected mice treated with 
the light stimulation protocol (4 ms pulse, 1 Hz, 10 min) 45 min before the 
cocaine injection on day 8 (t9 = 3.47).n = 8-12, *P < 0.05. NS, not significant. 
e, Acute slices of cocaine- and laser-treated AAV-ChR2 or AAV-control 
infected mice were prepared on day 8 (45 min after light stimulation). Sample 
traces, cumulative probability and mean values of amplitude and frequency for 
mEPSCs recorded from D1R-MSNs are shown. mEPSC amplitude was 
decreased in AAV-ChR2 compared to AAV-control infected mice (t)7 = 5.08; 
Kolmogorov-Smirnov: P < 0.001). mEPSC frequency was unaltered 

(ti7 = 1,45; Kolmogorov-Smirnov: P > 0.05). Scale bars: 200 ms and 20 pA. 
n= 9-10, *P < 0.05. f, The light stimulation protocol restored HFS LTP in 
D1R-MSNs from AAV-ChR2 but not AAV-control infected mice 

(191.0 + 21.2% and 93.0 + 9.1%, respectively, ty). = 4.26). Scale bars: 10 ms and 
20 pA. n = 7, *P <0.05. Error bars show s.e.m. 


reduced their mean amplitude in DIR-MSNs (Fig. 3e) to a level com- 
parable to baseline transmission (Fig. le) without any effect on paired 
pulse ratio (Supplementary Fig. 10). In contrast, the mEPSC amplitude 
and frequency were not modified in D2R-MSNs after cocaine (Sup- 
plementary Fig. 11). Lastly, after cocaine treatment and optogenetic 
depotentiation the HFS protocol resulted in LTP ex vivo (Fig. 3f), 
confirming the restoration of baseline transmission in D1R-MSNs. 
Behavioural sensitization typically refers to the observation of 
increased locomotor responses with repeated injections of cocaine. 
However, sensitization becomes apparent already after the second 
injection and is best observed about a week later” (Fig. 3). Although 
certainly not sufficient to induce addiction, such early forms of drug- 
induced adaptations are considered permissive building blocks for 
more definite behavioural alterations. If this is the case, the synaptic 
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potentiation onto D1R-MSNs should still be observed after chronic 
cocaine exposure, for example after 5 daily injections followed by a 
10-day withdrawal. To confirm this hypothesis we recorded mEPSCs 
at the end of this protocol, as well as 1h and 24h after a challenge 
injection (Supplementary Fig. 12). This experiment confirmed that 
excitatory synapses onto DIR-MSNs were selectively potentiated at 
the end of the withdrawal period. 

We then tested whether, after this chronic exposure to cocaine, light 
stimulation was still effective in reversing locomotor sensitization. 
After the 5 days of injection we imposed 10 days of withdrawal, and 
treated the mice with light stimulation 45min before injecting a 
challenge dose. This completely reversed locomotor sensitization 
(Fig. 4a). When a second challenge injection was administered 24h 
later, sensitized responses were again only observed in control mice. As 
a control, we did not observe an acute effect on locomotor behaviour 
during light stimulation (Supplementary Fig. 14). 

Lastly, to estimate how long the effect of the light stimulation lasted, 
we tested for behavioural sensitization 5 days after the intervention 
(Fig. 4b). No significant locomotor sensitization was observed even when 
the mice were challenged at this extended time after light stimulation. 

Our results identify NMDAR- and ERK-dependent LTP in D1R- 
MSNs of the nucleus accumbens as a form of synaptic plasticity required 
for locomotor sensitization to cocaine. ERK activation probably consti- 
tutes a general feature of addictive drugs, because in all brain regions that 
receive dopamine inputs tetrahydrocannabinol, amphetamines, 
morphine and nicotine also activate ERK signalling’. Through 
cocaine-driven ERK phosphorylation, potentiation is induced selec- 
tively in DIR-MSNs, which leads to the occlusion of HFS-driven LTP. 
These findings are in line with observations that ERK activation may 
control AMPAR trafficking directly, an effect that may also be 
maintained over days through activation of ERK nuclear targets lead- 
ing to gene regulation”. 
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Figure 4 | Optogenetic depotentiation resets behavioural sensitization 
induced by chronic cocaine injections. a, After 5 daily cocaine injections a 
robust locomotor sensitization is measured in AAV-control and AAV-ChR2 
infected mice. The light stimulation protocol (45 min before a challenge 
injection of cocaine on day 15) abolished the locomotor sensitized response in 
AAV-ChR2 infected mice (one-way analysis of variance: effect of treatment 
F2,41) = 88.65, P< 0.001. Post-hoc comparison by Bonferroni test yielded: 
*P <0.001 for cocaine versus saline pre-treatment in AAV control; 

**P < 0.001 for AAV-ChR2 versus AAV-control infected mice, pre-treated 
with cocaine (m = 13-15). b, Light stimulation protocol applied on day 10 still 
reduced the locomotor sensitized response to a cocaine challenge injection on 
day 15 (one-way analysis of variance: effect of treatment Fiz 45) = 15.31, 
P<0.001. Post-hoc comparison by Bonferroni test yielded: *P < 0.001 for 
cocaine versus saline pre-treatment in AAV-control infected mice; **P = 0.003 
for AAV-ChR2 versus AAV-control infected mice, pre-treated with cocaine. 
No sensitization in AAV-ChR2 (P = 0.27, compared to saline pre-treatment; 
n= 11-21). Error bars show s.e.m. 
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We have explored and gained insight into the molecular mechan- 
isms of synaptic adaptations to develop a strategy for reversal of 
cocaine-evoked potentiation with the goal of normalizing behaviour. 
We chose an optogenetic depotientiaton of inputs from the infralimbic 
cortex to the nucleus accumbens shell because of the strong anatomical 
connection and the functional implication of this projection in 
cocaine-seeking behaviour**”. It is an appealing idea that behavioural 
adaptation, which closely reflects the potentiation of excitatory trans- 
mission onto DIR-MSNs, is due to an imbalance of the two classes of 
MSNs”. 

Several studies have already reported that pharmacological and 
molecular manipulations of key players in synaptic plasticity in the 
nucleus accumbens can affect adaptive behaviours associated with 
addictive drug exposure. For example, the inhibition of calcium- 
permeable AMPARs, a hallmark of late-stage cocaine-evoked synaptic 
plasticity in the nucleus accumbens” and the viral expression of a 
peptide that impairs GluA1 trafficking’*, reduce cue-induced cocaine 
seeking and cocaine-primed reinstatement, respectively. 

We provide proof of principle that optogenetic manipulations can 
be used to reverse cocaine-evoked synaptic plasticity and thus abolish 
locomotor sensitization. Although light stimulation fully resets 
locomotor behaviour, sensitization begins to reappear after a few days, 
suggesting that several treatment sessions may be required to obtain 
long-lasting effects. This is not surprising, as chronic cocaine exposure 
also induces a number of additional adaptive changes including struc- 
tural remodelling (for example, increase in spines”) and alterations of 
gene expression”. 

Sensitization to cocaine-associated stimuli has been linked with 
incentive saliency’ and may explain the exceptionally strong motiva- 
tion of addicts to obtain the drug. With chronic use, early adaptive 
changes such as those described here may build up to enhance craving 
during cocaine withdrawal’. Successful interventions that reverse 
these changes in animal models could inspire novel treatments for 
human addiction, a disease with a high social burden. Indeed, novel 
protocols of deep brain stimulation or transcranial magnetic stimu- 
lation may induce forms of synaptic plasticity that reverse drug-evoked 
adaptations, thus curbing the risk of relapse. 


METHODS SUMMARY 


All experiments were reviewed by the institutional ethics committee and approved 
by the relevant authorities of the Canton of Geneva. C57BL/6 or heterozygous 
BAC transgenic female and male mice, in which eGFP expression was driven by 
either DIR (drdla-eGFP) or D2R (drd2-eGFP) gene regulatory elements were 
injected intraperitoneally with saline or cocaine. We then prepared brain slices 
for electrophysiological recordings as previously described? or placed the mice in 
an apparatus to quantify locomotor behaviour. Locomotor sensitization or 
synaptic plasticity was monitored at the various time points after the cocaine 
injection. A two-injection or five-injection protocol to induce locomotor sensitiza- 
tion was used as previously described”. Standard surgical procedures’ were used to 
infect mice with ChR2-AAV or a control AAV (0.5 tl) in the infralimbic medial 
prefrontal cortex while the light guides were aimed at both nucleus accumbens 
(shell). In depotentiation experiments, a 473-nm solid-state laser was used to carry 
out the in vivo light stimulation protocol in awake mice (600 pulses of 4ms 
duration at 1 Hz, 10-20 mW), 45 min or 5 days before behavioural testing or ex 
vivo electrophysiology recordings, respectively. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice. C57BL/6 or heterozygous BAC transgenic mice, in which eGFP expression 
was driven by either D1R (drdla-eGFP) or D2R (drd2-eGFP) gene regulatory 
elements were backcrossed*' in C57BI/6 mice for three to four generations, were 
used. Mice were housed in groups of 3-4 except for those implanted with guide 
cannulae, in which case animals were housed separately. All animals were kept ina 
temperature- and hygrometry-controlled environment with a 12 h light/12h dark 
cycle. Mice were injected intraperitoneally with 20 mgkg | cocaine, 40 mgkg ' 
SL327 (dissolved in 25% DMSO) or 0.9% saline (injection volume 10 ml kg). 
Immediately after injection, mice were placed in the locomotor recording 
apparatus for 1h. All procedures were approved by the Institutional Animal 
Care and Use Committee of the University of Geneva. 

Locomotor sensitization. Locomotor activity was measured as the number of 
quarter turns entirely crossed by a mouse in a circular corridor. Locomotor chamber 
apparatus was placed under a video tracking system (Any-maze, Stoelting) and 
measurements were made automatically by the software. After 3 days of habituation 
to the test apparatus, mice underwent the experimental procedure, which consisted 
of two sessions of 60 min separated by 1 week (or 1 month), called day 1 and day 8 
(or day 30). During the day 1 session, mice received saline or cocaine and were 
placed immediately in the corridor for 60 min. One week or 1 month later (day 8 or 
day 30), a second session was performed during which all mice were injected with 
cocaine before being placed in the circular corridor for 60 min. The light stimulation 
protocol (600 pulses of 4ms at 1 Hz) was done 45 min before the second cocaine 
injection. To compare the effects of various times after the first injections or various 
virus infections, locomotor activity in response to the second cocaine injection was 
normalized to the mean locomotor activity of saline-pre-treated mice and the 
sensitization index was calculated by dividing the normalized locomotor response 
to the second injection by the normalized response to the first injection. Locomotor 
sensitization was also evaluated during challenge sessions that followed a chronic 
treatment (5 days of cocaine 15mgkg ', 10 days withdrawal). The light stimu- 
lation protocol was done 45 min or 5 days before the challenge injection of cocaine. 
Virus stereotaxic injection of ChR2-AAV or control AAV. AAV] viruses pro- 
duced at the University of North Carolina (Vector Core Facility) were injected into 
the infralimbic cortex of 15-20 g wild-type or BAC transgenic mice. Anaesthesia 
was induced and maintained with isoflurane (Baxter AG). The animal was placed in 
a stereotaxic frame (Angle One) and craniotomies were performed using stereotaxic 
coordinates (anterio-posterior, + 1.9; medio-lateral, +0.3; dorso-ventral, 2.4-2.6). 
Injections of AAV1 viruses (0.5 pl) were carried out using graduated pipettes 
(Drummond Scientific Company), broken back to a tip diameter of 10-15 um, at 
arate of ~0.05 il min’ '. In all experiments the viruses were allowed a minimum of 
3 weeks to incubate before any other procedures were carried out. As a control, 
some mice were injected with an AAV containing only GFP. 

Cannula implantation. Following anaesthesia and craniotomy over the nucleus 
accumbens, two holes were drilled around the craniotomy and screws were placed 
in the holes. Two weeks after viral injections, guide cannulae (Plastics One) were 
lowered slowly into position using stereotaxic coordinates (bilaterally anterio- 
posterior, +1.5; medio-lateral, + 1.6; dorso-ventral 4.1; 15° angle) and cemented 
in place using dental cement (Lang Dental MFG Company) to encase the base of 
the guide cannulae and the screws. Once the cement had dried, a dummy cannula 
(Plastics One) was placed inside each guide cannula to prevent infection. 

Slice electrophysiology. Coronal 200-250-um slices of mouse forebrain were 
prepared in cooled artificial cerebrospinal fluid (ACSF) containing (in mM): 
NaCl 119, KCI 2.5, MgCl 1.3, CaCl, 2.5, Nag HPO, 1.0, NaHCO; 26.2 and glucose 
11, bubbled with 95% O, and 5% CO). Slices were kept at 32-34 °C in a recording 
chamber superfused with 2.5mlmin ' ACSF. Visualized whole-cell voltage- 
clamp recording techniques were used to measure holding and synaptic responses 
of MSNs of the nucleus accumbens shell, identified in some experiments by the 


presence of the GFP of BAC transgenic mice by using a fluorescent microscope 
(Olympus BX50WI, fluorescent light U-RFL-T). The holding potential was 
—70mV, and the access resistance was monitored by a hyperpolarizing step of 
—14mV with each sweep, every 10s. Experiments were discarded if the access 
resistance varied by more than 20%. Synaptic currents were evoked by stimuli (50- 
100 ps) at 0.1 Hz through bipolar stainless steel electrodes placed at the cortex- 
nucleus-accumbens border. The internal solution contained (in mM): 140 k-glu- 
conate, 5 KCI, 130 CsCl, 10 HEPES, 0.2 EGTA, 2 MgCls, 4 Na,ATP, 0.3 Na;GTP 
and 10 sodium creatine-phosphate. Currents were amplified (Multiclamp 700B, 
Axon Instruments), filtered at 5kHz and digitized at 20kHz (National 
Instruments Board PCI-MIO-16E4, Igor, WaveMetrics). The liquid junction 
potential was small (—3 mV), and therefore traces were not corrected. All experi- 
ments were carried out in the presence of picrotoxin (100 1M). LTP was induced 
by using the following HFS protocol: 100 pulses at 100 Hz repeated 4 times at 
0.1 Hz paired with depolarization at 0 mV**”. 

Miniature EPSCs were recorded in the presence of tetrodotoxin (0.5 1M). The 
frequency, amplitudes and kinetic properties of these currents were then analysed 
using the Mini Analysis software package (v.4.3, Synaptosoft). Cocaine or light 
illumination of ChR2-induced changes in cumulative miniature EPSC amplitude 
and inter-event interval distribution were analysed for statistical significance using 
the nonparametric two-sample Kolmogorov-Smirnov test (KyPlot) with a con- 
servative critical probability level of P< 0.05. 

Paired pulse ratio (PPR) was calculated by dividing the second evoked EPSC by 
the first with a 50-ms interval in between. 

AMPAR EPSCs evoked with ChR2 stimulation by 4-ms light pulses (LED, 

Thorlabs) were recorded in the same conditions as electrically evoked synaptic 
currents. Low-frequency stimulation (LFS; 1 Hz for 10 min) was applied with light 
pulses and the magnitude of LTD was determined by comparing average EPSCs 
that were recorded 20-30 min after induction to EPSCs recorded immediately 
before induction. 
In vivo stimulation of infralimbic cortex projections in the nucleus accumbens 
shell. Virus injected and cannulated animals were allowed a minimum of 1 week to 
recover and to express the virus. 473-nm solid-state lasers (GMP, CH) were used to 
carry out the in vivo stimulation protocol in awake mice. A fibre optic (Thorlabs) 
was customized to enable the mouse to move freely during stimulation. Briefly, the 
plastic cap of a dummy cannula (Plastics One) was hollowed out and a hole of 
sufficient diameter for the fibre optic to pass through made in the top. This was 
threaded onto the fibre optic, one end of which was stripped to leave a 200 pm 
external diameter. The fibre was then lowered into the guide cannula on the mouse 
and the hollowed-out dummy cannula cap screwed onto the guide cannula. An 
FC/PC rotative fibre-optic rotary joint (Doric lenses) was used to release torsion in 
the fibre caused by the animal’s rotation. The fibre was connected to the laser, 
which delivered 4 ms pulses at 1 Hz for 10 min, an established LTD protocol at 
excitatory synapses in the nucleus accumbens™. All stimulations were carried 
out in the mouse home cage (except in the experiment shown in Supplementary 
Fig. 10, in which stimulation was performed during locomotor recordings in the 
circular corridor) 45 min or 5 days before behavioural testing or ex vivo electro- 
physiology recordings. 
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Modulation of TRPA1 thermal sensitivity enables 
sensory discrimination in Drosophila 


Kyeongjin Kang'*, Vincent C. Panzano'*, Elaine C. Chang’, Lina Ni', Alexandra M. Dainis', Adam M. Jenkins”, Kimberly Regna’, 


Marc A. T. Muskavitch?*** & Paul A. Garrity! 


Discriminating among sensory stimuli is critical for animal sur- 
vival. This discrimination is particularly essential when evaluating 
whether a stimulus is noxious or innocuous. From insects to 
humans, transient receptor potential (TRP) channels are key trans- 
ducers of thermal, chemical and other sensory cues’”. Many TRPs 
are multimodal receptors that respond to diverse stimuli’*, but 
how animals distinguish sensory inputs activating the same TRP is 
largely unknown. Here we determine how stimuli activating 
Drosophila TRPA1 are discriminated. Although Drosophila 
TRPA1 responds to both noxious chemicals* and innocuous warm- 
ing’, we find that TRPAl-expressing chemosensory neurons 
respond to chemicals but not warmth, a specificity conferred by 
a chemosensory-specific TRPA1 isoform with reduced thermo- 
sensitivity compared to the previously described isoform. At the 
molecular level, this reduction results from a unique region that 
robustly reduces the channel’s thermosensitivity. Cell-type 
segregation of TRPAI activity is critical: when the thermosensory 
isoform is expressed in chemosensors, flies respond to innocuous 
warming with regurgitation, a nocifensive response. TRPAI 
isoform diversity is conserved in malaria mosquitoes, indicating 
that similar mechanisms may allow discrimination of host-derived 
warmth—an attractant—from chemical repellents. These findings 
indicate that reducing thermosensitivity can be critical for TRP 
channel functional diversification, facilitating their use in contexts 
in which thermal sensitivity can be maladaptive. 

Highly temperature-responsive TRP cation channels, thermoTRPs, 
mediate thermosensation from insects to mammals’ and are import- 
ant for human pain and inflammation’®. Like mammalian thermoTRPs, 
Drosophila melanogaster TRPA] is both a thermal and chemical sensor, 
responding to innocuous warmth (above ~25-27 °C)°” and noxious 
chemicals*. TRPA1 acts in thermosensors within the brain to modulate 
thermal preference over 18-32 °C (ref. 5), innocuous temperatures 
compatible with fly survival’, and in gustatory chemosensors to inhibit 
ingestion of electrophiles*, reactive chemicals like allyl isothiocyanate 
(AITC, found in wasabi) and N-methylmaleimide (NMM) that rapidly 
incapacitate flies (Supplementary Fig. 1). The responsiveness of TRPA1 
to both innocuous and noxious stimuli raises the question of how these 


Figure 1 | TRPA1-dependent gustatory neurons do not respond to heat. 

a, TRPA1 immunostaining of wild-type (top) and TrpAl'”® (bottom) labella. 
Right, differential interference contrast overlay of labellar structures. 
Arrowheads, chemosensor cell bodies. Scale bar, 20 um. b, Top, Drosophila 
gustatory organs: LSO, labral sense organ; OES, oesophagus. Bottom, labellar 
bristles. Brown, s-type; grey, L-type; black, i-type; berberine-sensitive bristles 
were targeted for electrophysiology. c, d, Bristle responses to: berberine (1 mM) 
and NMM (10 mM) (c); warming (d). Temp., temperature. e, Average spike 
rate after subtracting electrolyte-only baseline. freq., frequency. **P < 0.01; NS, 
not significant (P > 0.05), t-test. All data are mean + standard error of the 
mean (s.e.m.). Warming reached an average maximum temperature of 

39.0 + 0.6 °C (mean = standard deviation). 


stimuli are distinguished to elicit distinct behavioural responses. 
Mammals face similar issues; for example, TRPM8 transduces both 
innocuous and noxious cold’. 

We previously reported TRPA1-expressing chemosensors in the 
labral sense organ*; using improved immunostaining conditions, we 
now also detect specific TRPA1 protein expression in labellar chemo- 
sensors (Fig. 1a, b). Extracellular tip recording? indicated these neurons 
were TRPA1-dependent chemosensors; they responded to the electro- 
phile NMM with robust spiking in wild type but not TrpA1 mutants 
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(Fig. 1c and Supplementary Fig. 2). The mutant defect was electrophile 
specific, as TrpAl mutants responded like wild type to berberine 
chloride (Fig. 1c), a bitter compound that also activates these neurons”®. 
In contrast, warming to ~39 °C, from innocuously warm to the noxious 
range, elicited no spiking in these cells (Fig. 1d, e). This is notable as the 
effectiveness of TRPA1 in conferring warmth sensitivity has led to its use 
as a thermogenetic tool*’’. Thus, despite the known sensitivity of 
TRPAI to both temperature and chemicals, these chemosensors are 
warmth insensitive. 

In addition to the previously characterized transcript, TrpA1(B), a 
transcript with an alternative 5’ end, TrpA1(A), has been annotated!” 
(Fig. 2a). These transcripts encode protein isoforms with distinct 
amino termini, but the same ankyrin and transmembrane domains 
(Fig. 2b). Polymerase chain reaction with reverse transcription (RT- 
PCR) demonstrated differential expression: TrpA1(A) was expressed 
in the proboscis, which houses the TRPA1-expressing chemosensors, 
whereas TrpA1(B) predominated elsewhere in the head, where 
TRPA1-expressing thermosensors are located (Fig. 2c). 

Examined in Xenopus oocytes, TRPA1(A) was much less thermo- 
sensitive than TRPAI(B), as reflected in its temperature coefficient 
(Qio), the fold change in current per 10 °C change’. Arrhenius plot 
analysis’* yielded a Qio of ~9 for TRPAI(A) versus ~116 for 
TRPA1(B) (Fig. 2d-f). In addition, whereas TRPA1(B) was essentially 
inactive at low temperatures, TRPA1(A)-dependent currents were 
observed =15°C, further reducing the temperature-dependent activity 
differential of TRPA1(A) (Supplementary Fig. 3). The maximum heat- 
activated current for TRPA1(A) was also significantly lower (Sup- 
plementary Fig. 4). Lastly, the transition (or threshold) temperature 
for increased temperature responsiveness was 29.7+0.3°C for 
TRPAI(A) versus 27.8 + 0.4°C for TRPAI(B) (P<0.01, t-test). As 
the innocuous warm temperature range in Drosophila is of particular 
behavioural relevance, the Qj from 27-37°C (below the ~38°C 
nociceptive threshold in Drosophila'*) was also calculated, yielding 


6.2+0.5 for TRPAI(A) and 90+ 8 for TRPA1(B) (Fig. 2f). Other 
properties were largely unaffected; both channels responded robustly 
to electrophiles and had similar voltage sensitivities (Fig. 2g,h). TRPA1(A) 
and TRPA1(B) had similar maximum current amplitudes at 300 UM 
NMM, with half-maximum effective concentration (EC;9) values of 
176 + 12 and 128 + 9 uM, respectively (Fig. 2i). 

The reduced thermosensitivity of TRPA1(A) could account for the 
chemosensors’ warmth insensitivity. But although TRPA1(A) is less 
temperature sensitive than TRPA1(B), its Qig resembles several TRPs 
implicated in warmth sensitivity’. To assess whether TRPA1(A) 
could confer warmth sensitivity upon Drosophila chemosensors, each 
isoform was used to rescue a TrpAl mutant. We previously demon- 
strated that expressing TRPA1(B) in TRPA1-dependent chemosensors 
using Gr66a-Gal4 rescues the T7pA 1 mutant behavioural defect*. Using 
electrophysiology, we found both isoforms restored NMM responsive- 
ness (Fig. 3a and Supplementary Fig. 5a, b), but only TRPA1(B) con- 
ferred warmth sensitivity (Fig. 3b, c). These differences did not require 
properties unique to TRPA1-dependent chemosensors. Each isoform 
was expressed ectopically in sweet-responsive chemosensors using 
Gr5a-Gal4 (ref. 18). Both isoforms conferred electrophile sensitivity 
upon these normally electrophile-insensitive neurons, but only 
TRPAI(B) conferred thermosensitivity (Fig. 3d-f and Supplemen- 
tary Fig. 4c, d). The inability of TRPA1(A) to confer warmth sensitivity 
on fly chemosensors emphasizes that although a Qig above 5 makes 
TRPA1(A) more thermally sensitive than most ion channels, in vivo 
testing is important in evaluating whether a channel is sufficiently 
thermosensitive to make a specific neuron warmth responsive. 

These data support a model in which the specificity of TRPA1- 
expressing gustatory neurons for chemicals is established by their 
selective expression of TRPA1(A), an isoform unable to confer warmth 
sensitivity. In contrast, the chemical sensitivity of TRPA1(B) should 
render TRPA1-dependent thermosensors sensitive to reactive chemicals. 
However, the location of TRPA1-dependent anterior cell thermosensors 
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Figure 2 | TRPA1 isoform diversity yields tissue-specific channels with 
different thermal sensitivities. a, TrpA1 gene structure and primer locations. 
b, TRPA1 proteins. Dark red and blue boxes denote isoform-specific sequences. 
Dark grey, transmembrane (TM) region. a, ankyrin repeat; aa, amino acids. 
c, RT-PCR analysis of dissected tissue. d, e, TRPA1(A)- and TRPA1(B)- 
dependent currents (d) and Arrhenius plots (e) in oocytes. T,,, transition 
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temperature. f, Qi values from Arrhenius plot (left) or 27-37 °C (right). 

g, h, Left panels, NMM responsiveness of TRPA1(A) (g) and TRPA1(B) 

(h). Right panels, I-V relationships at points marked at left. i, Mean amplitudes 
at 300 1M NMM (left) and NMM dose-response (right). All data, 

mean + s.e.m. **P < 0.01; NS, not significant, t-test. 
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Figure 3 | TRPA1 isoform diversity determines sensory specificity of 
gustatory neurons. a-c, TrpA1 mutant, berberine-sensitive i-type bristles 
expressing different TRPA1 isoforms. a, b, Electrophysiological responses to 
NMM (a) and warming (b). c, Quantification. d-f, L-type bristles expressing 
TRPAI isoforms. Responses to NMM (d) and warming (e). f, Quantification. 
g, Rescue of TrpA1 mutant behavioural response to NMM-containing food. 


inside the head* should minimize exposure to environmental irritants. 
Interestingly, multiple TRPV1 and TRPM1 isoforms are present in 
humans and other mammals*'”’*”*, suggesting the potential generality 
of isoform diversity in modulating TRP functions. 

The behavioural significance of discriminating noxious from 
innocuous TRPAI activators was examined by testing gustatory res- 
ponses of TrpA1 mutants rescued by chemosensor expression of each 
isoform. TrpAl mutants exhibit decreased avoidance of reactive- 
electrophile-containing food*. Each isoform rescued this behaviour 
(Fig. 3g). However, TRPA1(B) also triggered a nocifensive response 
to innocuous warming. When allowed to ingest water to satiation and 
warmed to ~32 °C, neither wild-type nor TRPA1(A) rescue animals 
showed detectable gustatory responses (Fig. 3h, i). However, warming 
TRPA1(B) rescue flies caused ~75% to regurgitate (Fig. 3h, i and Sup- 
plementary Movie 1). Thus, substituting TRPA1(B) for TRPA1(A) in 
chemosensors disrupts discrimination of noxious from innocuous 
stimuli and demonstrates the negative behavioural consequence of 
misregulated thermosensitivity. 

To probe how the alternative N termini in TRPA1 confer distinct 
properties, conserved residues within these regions were mutated 
(Fig. 4a). Mutating either a cysteine (C105) or two basic residues 
(R113, R116) in TRPA1(A) markedly increased temperature respon- 
siveness (Fig. 4a—c and Supplementary Figs 6 and 7). Whereas the Qio 
of wild-type TRPA1(A) was <10, the TRPA1(A) mutants exhibited 
Qio values of >50 (Fig. 4b, c), greater than the reported Qiy values of 
canonical thermoTRPs like TRPM8 (~24)?! and TRPV1 (~40)”. In 
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PER, proboscis extension response. h, Warmth-induced regurgitation in TrpA1 
mutant rescued with TRPA1(B). i, Regurgitation upon warming from room 
temperature (~23 °C) to 32 °C. Ing, f, g and i, statistically distinct groups are 
marked by a, b and c. (Tukey honestly significant difference (HSD) test, 

a= 0.01). Data are mean + s.e.m. 


addition, the TRPA1(A) mutants conducted little current below the 
threshold, increasing the temperature-dependent activity differential 
(Fig. 4c and Supplementary Fig. 8). The enhanced sensitivities of the 
mutants seemed to be temperature specific, as NMM sensitivity was 
not increased (Supplementary Fig. 8). These data indicate that 
TRPAI(A) retains all the requirements for robust thermosensation, 
but contains a modulatory region preventing those elements from 
exerting their full effect. 

For TRPA1(B), mutating either a conserved tryptophan or two basic 
residues in the N terminus yielded channels retaining robust thermo- 
sensitivity (Qi9 > 50; Fig. 4b and Supplementary Fig. 6). The thresholds 
of the TRPA1(A) and TRPA1(B) mutants were all ~30-34 °C, within 
the innocuous warm range but above the ~28 °C value for wild-type 
TRPAI(B) (Fig. 4b, c). Thus, although TRPA1(B)-specific sequences 
are unnecessary for robust responsiveness to innocuous warming, they 
may tune channel threshold within this range. 

In insect disease vectors, TRPA1 orthologues have been implicated 
in detecting both warmth and chemical repellents*”***, cues with 
opposing effects on host seeking. We found the malaria mosquito 
Anopheles gambiae also contains TRPA1(A) and TRPA1(B) isoforms 
of differing thermosensitivity (Fig. 4d-h). In oocytes, the Qio of 
A. gambiae TRPA1(A) was ~4 versus a Qio of ~200 for A. gambiae 
TRPA1(B); from 27-37 °C, the Qip of A. gambiae TRPA1(A) was ~2 
versus ~60 for A. gambiae TRPA1(B) (Fig. 4h). A. gambiae TRPAI(A) 
yielded lower maximum heat-induced current than A. gambiae 
TRPAI(B) (Supplementary Fig. 4) and had a higher threshold 
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Figure 4 | Regulation of insect TRPAI thermosensitivity by alternative N 
termini. a, TRPA1 sequence alignments. Acyr. pisum, Acyrthosiphon pisum; 
Aedes ag., A. aegypti; Anopheles g., A. gambiae; Dros. mel., D. melanogaster; 
Dros. vir., D. virilis. b, Qo values and transition temperatures (Trans. temp.) of 
wild-type (WT) and mutant TRPA1. Dm, D. melanogaster. a, b and c denote 
statistically distinct groups (Tukey HSD, « = 0.02). c, Arrhenius plots of 
indicated channels. d, A. gambiae (Ag)TrpA1 gene structure. e, AgTRPA1 


(34.2 + 1.8 °C versus 25.2 + 0.9°C, P< 0.01). A. gambiae TRPAI(A) 
also exhibited significant conductance below threshold (Fig. 4f, g). 
Both channels responded to electrophiles (Supplementary Fig. 9). 
TRPAI(A) and TRPA1(B) are conserved in other haematophagous 
insects including Aedes aegypti and Culex quinquefasciatus mosquitoes 
and Pediculus humanus corporis lice (Fig. 4a and Supplementary 
Fig. 10), which transmit dengue, West Nile fever and typhus, respec- 
tively. The functional diversity of TRPA1 provides a potential explana- 
tion for how insect vectors discriminate noxious chemicals from 
host-derived warmth, indicating that TRPA1 presents two distinct 
molecular targets for disrupting pest behaviour. 

TRPA1-based electrophile detection appears to have emerged =500 
million years ago in a common vertebrate/invertebrate ancestor’. 
However, the larger TRPA family extends to choannoflagellates, 
separated from animals =600 million years*. As divergent TRPA clades 
contain highly temperature-sensitive channels’, thermosensitivity may 
be ancestral. In this scenario, the specialization of TRPA1 for noxious 
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isoforms. a, ankyrin repeat; aa, amino acids. Blue and dark red indicate isoform- 
specific amino acids. Dark grey indicates transmembrane (TM) region. 

f-h, Temperature sensitivity of AgTRPA1(A) and AgTRPA1(B). f, g, Traces 
(f) and Arrhenius plots (g) of temperature-dependent current recordings at 
—60 mV in Xenopus oocytes. h, Qio values from Arrhenius plot (left) or 27- 
37 °C (right) (**P < 0.01, t-test). Error bars, + s.e.m. 


chemical detection would necessitate reducing thermosensitivity, con- 
sistent with the effect of the N terminus in TRPA1(A). The ability of 
N-terminal variation to sculpt channel properties is intriguing as the N 
terminus is the most divergent region of TRPA1 within insects and from 
insects to mammals’. 

TRPs are a large family of channels, with 27 human and 13 
Drosophila members, which vary greatly in thermosensitivity and 
function’. Considerable diversity is evident even among closely related 
TRPs. In mammals, for example, TRPM8 (Qi9 ~24; ref. 21) mediates 
thermosensation*, whereas the less thermosensitive TRPM4 and 
TRPM5 (Qj) ~8.5-10; ref. 15) mediate insulin secretion*? and 
TRPM7 (with no reported thermal sensitivity) is implicated in ion 
homeostasis’. The mechanisms underlying such diversification are 
unclear. Whereas studies of thermal sensing by TRPs have focused 
on identifying regions promoting thermosensitivity'’*”’, our work 
indicates that regions reducing thermosensitivity are also critical. 
Here we find that selectively reducing the thermosensitivity of 


Drosophila TRPAI facilitates its use in a context in which thermosen- 
sitivity is undesirable. Similar mechanisms could mediate functional 
diversification not only among isoforms of a single TRP, but also 
contribute to the remarkable functional diversification observed 
between different TRP family channels. 


METHODS SUMMARY 

Fly strains and immunohistochemistry. UAS-TrpA1(B) and Gr66a-Gal4 trans- 
genic strains and the TrpA1'”* mutant have been described’. The UAS-TrpA1(A) 
transgene was amplified from fly complementary DNA with an isoform-specific 
primer (5’-TATAAAGCTTAAGCCACCATGATTACAGCTCCGGCCACGGC 
CA-3') and a reverse primer (5’-GAGACTCGAGCTACATGCTCTTATTGA 
AGCTCAGGGCG-3’). As detailed in Methods, the UAS-TrpA1(A) transgene 
was inserted in the same genomic location used for the UAS-TrpA1(B) transgene 
to control for transgene position effects. Anti- TRPA1 immunohistochemistry was 
as described’, except that the secondary antibody was incubated for 3 days. 
Behaviour. The proboscis extension assay was conducted as previously 
described’, with seven flies per experiment, three experiments per genotype. For 
heat-sensitive regurgitation, >20 flies per genotype (2-3 days old) were starved 
overnight with water, then glued to glass slides. After 2~3h recovery, flies were 
satiated with water. Only flies drinking longer than 5 s were tested. Drinking times 
did not significantly differ between wild-type and rescue flies (E.C.C. and P.A.G., 
unpublished data), consistent with similar ingestion behaviours. Flies were heated 
with a radiant heater at 800 W (H-4438, Optimus) and temperature was monitored 
by adjacent thermocouple microprobe (IT-23, Physitemp Instruments) wrapped 
in fly cuticle. 

Physiology. Oocyte physiology was performed as described’, with additional 
details provided in Methods. Extracellular recordings of gustatory neurons were 
obtained by tip recording’, as detailed in Methods. 

Molecular biology. cDNA was prepared from dissected D. melanogaster tissue 
(RETROscript, Ambion) and subjected to PCR with three primers, two forward 
isoform-specific primers and a reverse common primer detailed in Methods. Each 
primer was designed to straddle a splice junction, minimizing amplification of 
contaminating genomic DNA. In all cases, similar results were obtained from four 
independent tissue preparations. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Fly strains and immunohistochemistry. The UAS-TrpA1(A) transgene was 
inserted into the genome by site-specific transgenesis*® at the same landing site 
as UAS-TrpA1(B), attp16 (ref. 31). 

Behavioural analysis. Chemicals used in incapacitation assays were sucrose 
(Calbiochem LC8510), sorbitol (Sigma S-1876), ficoll (Sigma F-4375), agarose 
(Invitrogen 15510-027), caffeine (Sigma C0750), NMM (Sigma 389412), isopropanol 
(100%, J. T. Baker 9083-03), ethanol (100%, Decon Lab 2716) and allyl isothio- 
cyanate (95%, Sigma 377430). 

Characterization of TRPA1 isoforms in Xenopus oocytes. TRPA1 currents 
were recorded as described*”. To evaluate temperature sensitivities, oocytes were 
perfused in the recording buffer (96 mM NaCl, 1 mM MgCl, 4mM KCl, and 5 mM 
HEPES, pH 7.6), the temperature of which was increased ~0.5 °C s | from 10 to 
45 °C by SC-20 in-line heater/cooler (Warner Instruments) with a CL100 bipolar 
temperature controller (Warner Instruments). Temperature-evoked current was 
recorded at —60mV. From the recorded current, Qj) was calculated as 
described'*!°. Arrhenius Qyg = 10(1(~Ss)/(T: 7), where Sarrhe is the slope of linear 
phase of an Arrhenius plot between absolute temperatures, T; and T>. Transition 
temperature was assessed as the temperature at which the least-squares fit lines 
from the two linear phases intersect’*’®. Qi from 27-37 °C was calculated from 
currents at temperatures of interest using the equation, Qiy =(h [hy Bt), 
where J,and I, are currents observed at temperatures of T; and T>, respectively. 
Qio determinations were validated by using Crotalus atrox TRPA1 (ref. 16) as a 
control with known Qio (K.K. and P.A.G., unpublished data). To assess sensitivity 
to NMM, voltage across the membrane was initially held at —80 mV, and a 300-ms 
voltage ramp (—80 mV to 80 mV) per second was applied. The oocytes were per- 
fused for 1 min with the recording buffer containing indicated concentrations of 
NMM with 30-s washes between NMM applications. Current amplitudes at 
—80mV after application of each NMM concentration were fitted to the Hill 
equation through Sigmaplot 10. The first coding exon of A. gambiae TrpA1(B) 
was chemically synthesized (Genscript). 

Gustatory neuron electrophysiology. Extracellular recordings of gustatory 
neurons were obtained using the tip-recording method’. Adult female flies, aged 
1-4 days, were prepared by inserting a glass reference electrode containing 
Drosophila Ringer’s solution into the thorax and advancing the electrode through 
the head to the labellum. A glass recording electrode with an ~ 15-,1m opening was 
used to apply tastants to individual sensilla. Raw signals were amplified using a 
TasteProbe preamplifier (Syntech) and were digitized and analysed using a 
PowerLab data-acquisition system with LabChart software (ADInstruments). 
Amplified signals were digitized at a rate of 20kbs ' and filtered using a 100- 
3,000 Hz band-pass filter before analysis. Individual action potentials were sorted 


using a visually adjusted threshold and average spike rate was calculated beginning 
200 ms after electrode contact. Recording times varied by experiment: berberine 
chloride and sucrose positive controls, 5 s; electrolyte only, 20 s; NMM on i-type 
bristles, 60 s; NMM on L-type bristles, 120s; heat ramps, >60s. For heat-ramp 
experiments, recordings were performed using electrolyte only as tastant. After 
~30s of recording to determine baseline activity, heat was applied manually to the 
fly using a radiant heater (PRESTO HeatDish, National Presto Industries). 
Application of heat was maintained for ~10-30s and the distance between the 
heat source and the preparation was reduced to obtain a temperature of =39 °C. 
Bristle temperature was estimated using thermocouple microprobe (IT-23, 
Physitemp Instruments) wrapped in fly cuticle. All tastants were dissolved in 
30 mM tricholine citrate as the electrolyte to inhibit the activity of the water cell 
in L-type bristles’. Tastants were stored at —20 °C and aliquots maintained at 4 °C 
for up to 1 week. For all experiments, a positive control was used to confirm the 
viability of the target bristle. For i-type bristles, 1 mM berberine chloride was used 
as control. For L-type bristles, 30 mM sucrose was used. Individual tastant pre- 
sentations were separated by a minimum delay of 60 s. At least two animals and six 
bristles were examined for each condition. 

Molecular biology. Primers for RT-PCR reactions: D. melanogaster TrpA1(A) 
forward, 5'-GCCGGAACAGCAAGTATT3-3’; D. melanogaster TrpA1(B) forward, 
5'-GTGGACTATCTGGAGGCG-3’; D. melanogaster TrpA1 common reverse, 
5'-TATCCT TCGCATTAAAGTCGC-3’. 

Mutagenesis of Drosophila TRPA1 was performed as described’. Briefly, for a 
desired mutation, each of two mutually complementary mutant primers was 
paired for PCR with a primer (outer primer) that anneals outside of either Sall 
or Hpal restriction recognition site. The two resulting PCR fragments that overlap 
only in the region of the two mutant primers were combined and served as 
template for the next PCR reaction that contained only outer primers. The second 
PCR product was digested by SalI and Hpal, and subsequently replaced the cor- 
responding wild type region of TRPA1 cDNA. The fragment between the two 
restriction sites was sequenced. Sequences were aligned using MUSCLE 3.7”. 
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